ONV-080.01 



Methods and Compositions for Regulating Lymphocyte Activity 



Priority Information 

This application claims priority to U.S. Provisional Application 60/168,1 12, filed 
November 30, 1999. 

Background of the Invention 

The lymphoid system performs critical functions in animals that include preventing and 
combating infection, and surveillance and immune elimination of tumor cells. Loss of immune 
function leads to an immunocompromised status that can predispose the host to serious and 
life-threatening disease. Functional abnormalities may be present in any of the elements that 
participate in mediating an immune response, e.g., cellular or humoral elements such as 
granulocytes, lymphocytes, complement, antibody, or cytokines^ 

The immune system is a network of cells adapted to protect the organism against 
pathogens and cells that are not recognized as "self." Once the immune system is activated, it 
enlists the participation of a variety of cells and molecules to mount an effector function 
designed to eliminate the "non-self 1 entity within the body. Lymphocytes are cells of the 
immune system that are capable of specifically recognizing and selectively eliminating foreign 
entities. By contrast to other cells of the immune system, such as neutrophils which are 
considered non-specific in their reactions to invaders, the characteristics of lymphocytes confer 
specificity, diversity, memory and self7nonself recognition to the immune response. 

There are two major populations of lymphocytes: B lymphocytes and T lymphocytes. B 
lymphocytes originate and mature within the bone marrow and are responsible for formation of 
antibody molecules. T lymphocytes also arise from the bone marrow but mature in the thymus. 
There are two major subpopulations of T-cells: T helper cells and T cytotoxic cells. The two 
types of T cells can be distinguished by the presence of one of two membrane glycoproteins, 
either CD4 or CD8. The T-helper cells (which express CD4) when activated by antigen- 
complexes (foreign molecules coupled to special proteins) respond by secreting various growth 
factors known collectively as cytokines. These cytokines are signals that activate other cells of 
the immune system, including the T-cytotoxic cells. The T-cytotoxic cells (which express 
CD8) when activated, proliferate and differentiate into cytotoxic T lymphocytes (CTL) which 
are able to monitor for and eliminate from the body pathogenic cells, foreign cells, virus- 
infected cells, and tumor cells. 
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The normal development, maturation and differentiation of T lymphocytes are regulated 
by various peptide factors 



5 Summary of the Invention 

The present invention provides methods and compositions for modulating the immune 
function of an animal by administering an agent which mimics, or alternatively inhibits, 
hedgehog-medi&ted signal transduction. Thus, the subject method can be used as an 
immunosuppressive, e.g., by administering a hedgehog agonist, or as an immunostimulatant, e.g., 
10 by administering a hedgehog antagonist 

Wherein the subject method is carried out using a hedgehog therapeutic, the hedgehog 

□ therapeutic preferably a polypeptide including a hedgehog portion comprising at least a bioactive 
^ extracellular portion of a hedgehog protein, e.g., the hedgehog portion includes at least 50, 100 
pj or 150 (contiguous) amino acid residues of an N-terminal half of a hedgehog protein.In preferred 
+; 15 embodiments, the hedgehog portion includes at least a portion of the hedgehog protein 
m corresponding to a 19kd fragment of the extracellular domain of a hedgehog protein. 

^ In preferred embodiments, the hedgehog portion has an amino acid sequence at least 60, 

M= 75, 85, or 95 percent identical with a hedgehog protein of any of SEQ ID Nos. 10-18 or 20, 

!7 S though sequences identical to those sequence listing entries are also contemplated as useful in the 

a 20 present method. The hedgehog portion can be encoded by a nucleic acid which hybridizes under 

□ stringent conditions to a nucleic acid sequence of any of SEQ ID Nos. 1-9 or 19, e.g., the 

Q 

hedgehog portion can be encoded by a vertebrate hedgehog gene, especially a human hedgehog 
gene. 

In other embodiments, the subject method can be carried out by administering a gene 
25 activation construct, wherein the gene activation construct is deigned to recombine with a 
genomic hedgehog gene of the patient to provide a heterologous transcriptional regulatory 
sequence operatively linked to a coding sequence of the hedgehog gene. 

In still other embodiments, the subject method can be practiced with the administration of 
a gene therapy construct encoding a hedgehog polypeptide. For instance, the gene therapy 
30 construct can be provided in a composition selected from a group consisting of a recombinant 
viral particle, a liposome, and a poly-cationic nucleic acid binding agent, 
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In yet other embodiments, the subject method can be carried out using a ptc therapeutic. 
An exemplary ptc therapeutic is a small organic molecule which binds to a patched protein and 
derepresses patched-mediated inhibition of mitosis, e.g., a molecule which binds to patched and 
mimics hedgehog-mediated patched signal transduction, which binds to patched and regulates 
5 patched-dependent gene expression. For instance, the binding of the ptc therapeutic to patched 
may result in upregulation of patched and/or gli expression. 

In a more generic sense, the ptc therapeutic can be a small organic molecule which 
interacts with MK cells to induce hedgehog-mediated patched signal transduction, such as by 
altering the localization, protein-protein binding and/or enzymatic activity of an intracellular 
10 protein involved in a patched signal pathway. For instance, the ptc therapeutic may alter the 
level of expression of a hedgehog protein, a patched protein or a protein involved in the 
intracellular signal transduction pathway of patched. 

O In certain embodiments, the ptc therapeutic is an antisense construct which inhibits the 

expression of a protein which is involved in the signal transduction pathway of patched and the 

ftj 15 expression of which antagonizes hedgehog-mediated signals. The antisense construct is 
perferably an oligonucleotide of about 20-30 nucleotides in length and having a GC content of at 

m least 50 percent. 

J In other embodiments, the ptc therapeutic is an inhibitor of protein kinase A (PKA), such 

H as a 5-isoquinolinesulfonamide. The PKA inhibitor can be a cyclic AMP analog. Exemplary 
20 PKA inhibitors include N-[2-((p-bromocinnamyl)amino)ethyl]-5-isoquinolinesulfonamide, l-(5- 
m isoquinoline-sulfonyl)-2-methylpiperazine, KT5720, 8-bromo-cAMP, dibutyryl-cAMP and PKA 
y Heat Stable Inhibitor isoform a. Another exemplary PKA inhibitor is represented in the general 
~~ formula: 



R2^ /R1 
N 
I 

o=s=o 




25 wherein, 

Rj and R 2 each can independently represent hydrogen, and as valence and stability permit 
a lower alkyl, a lower alkenyl, a lower alkynyl, a carbonyl (such as a carboxyl, an ester, a 
formate, or a ketone), a thiocarbonyl (such as a thioester, a thioacetate, or a thioformate), an 
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amino, an acylamino, an amido, a cyano, a nitro, an azido, a sulfate, a sulfonate, a sulfonamido, 
-(CH 2 ) m -R 8 , -(CH 2 ) m -OH, -(CH 2 ) m -0-lower alkyl, -(CH 2 ) m -0-lower alkenyl, -(CH 2 ) n -0- 
(CH 2 ) m -R 8 , -(CH 2 ) m -SH, -(CH 2 ) m -S-lower alkyl, -(CH 2 ) m -S-lower alkenyl, -(CH 2 ) n -S-(CH 2 ) m - 
R 8 , or 

5 R\ and R 2 taken together with N form a heterocycle (substituted or unsubstituted); 

R3 is absent or represents one or more substitutions to the isoquinoline ring such as a 
lower alkyl, a lower alkenyl, a lower alkynyl, a carbonyl (such as a carboxyl, an ester, a formate, 
or a ketone), a thiocarbonyl (such as a thioester, a thioacetate, or a thioformate), an amino, an 
acylamino, an amido, a cyano, a nitro, an azido, a sulfate, a sulfonate, a sulfonamido, -(CH 2 ) m - 
10 R 8 , -(CH 2 ) m .OH, -(CH 2 ) m -0-lower alkyl, -(CH 2 ) m -0-lower alkenyl, -(CH 2 ) n -0-(CH 2 ) m -R 8 , 
-(CH 2 ) m -SH, -(CH 2 ) m -S-lower alkyl, -(CH 2 ) m -S-lower alkenyl, .(CH 2 ) n -S-(CH 2 ) m -R 8 ; 

R 8 represents a substituted or unsubstituted aryl, aralkyl, cycloalkyl, cycloalkenyl, or 
% heterocycle; and 

psl n and m are independently for each occurrence zero or an integer in the range of 1 to 6. 

S 15 

*{; Brief Description of the Figures 

_ r*" 

f Figures 1-4 illustrate the profile for staining with various markers for T-cell maturity. 

KJ Detailed Description of the Invention 

O 

p 20 L Overview 

The present application is directed to the discovery that hedgehog gene products are able 
to regulate maturation of T lymphocytes. Certain aspects of the invention are directed to 
preparations of hedgehog polypeptides, or other molecules which regulate patched or 
smoothened signalling, and their uses as immunomodulatory agents against both acquired and 
25 hereditary immunological disorders. 

For instance, such compositions can be used to increase the population of T-helper cells 
to optimum levels in the host, e.g, to stimulate the immune system of the animal. Such uses of 
the subject compoistions can be used in the treatment of bacterial or viral infections, as well as to 
help the body fight against cancer cells. Alternatively, these substances also enable the host to 
30 adjust to diseases arising from disarrangement of self-recognition processes in which there is 
excessive attack by host T-cells against endogenous tissues. In such instances, the subject 
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compositions can be used to reduce T-cell population so that the signs and symptoms of self- 
directed inflammatory (autoimmune) diseases such rheumatoid arthritis and multiple sclerosis are 
ameliorated. 

As described herein, hedgehog proteins inhibit maturation of T lymphocytes. Based upon 
5 its inhibitory effect, the administration of hedgehog or ptc therapeutics (defined infra ) is 
suggested herein as a treatment for several types of immunological disorders involving unwanted 
activation of cellular immunity, e.g., graft rejection, autoimmune disorders, and the like. In other 
embodiments, inhibitors of hedgehog signalling pathways can be used as immunostimulatory 
agents, e.g., to counteract the effects of any endogenous activation of a hedgehog/ptc pathway. 

10 In general, the method of the present invention comprises administering to animal, or to 

cultured lymphocytes in vitro, an amount of a hedgehog or ptc therapeutic which produces a non- 
toxic response by the cell of inhibition of maturation (in the case of a hedgehog agonist) or 
O promotion of T cell maturation (in the case of a hedgehog antagonist). The subject method can be 
^ carried out on cells which may be either dispersed in culture or a part of an intact tissue or organ, 
fy 15 Moreover, the method can be performed on cells which are provided in culture (in vitro), or on 
cells in a whole animal (in vivo). 

In one aspect, the present invention provides pharmaceutical preparations and methods 
J for treating or preventing hyperimmune or hypoimmune disorders utilizing, as an active 

u ingredient, a hedgehog polypeptide or a mimetic thereof. The invention also relates to methods 

L*. 

^ 20 of controlling the functional performance of T cells by use of the pharmaceutical preparations of 
gy the invention. 

i c 

5 The subject hedgehog treatments are effective on both human and animal subjects afflicted 

with these conditions. Animal subjects to which the invention is applicable extend to both 
domestic animals and livestock, raised either as pets or for commercial purposes. Examples are 
25 dogs, cats, cattle, horses, sheep, hogs and goats. 

Without wishing to be bound by any particular theory, the inhibitory effect of hedgehog 
on T cell maturation may be due at least in part to the ability of hedgehog proteins to antagonize 
(directly or indirectly) patched-mediated regulation of gene expression and other physiological 
effects mediated by that protein. The patched gene product, a cell surface protein, is understood 
30 to signal through a pathway which causes transcriptional repression of members of the Wnt and 
Dpp/BMP families of morphogens, proteins which impart positional information. In other tissue, 
the introduction of hedgehog relieves (derepresses) this inhibition conferred by patched, allowing 
expression of particular gene programs. 
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Recently, it has been reported that mutations in the human version of patched, a gene first 
identified in a fruit fly developmental pathway, cause a hereditary skin cancer and may contribute 
to sporadic skin cancers. See, for example, Hahn et al. (1996) Cell 86:841-851; and Johnson et 
al. (1996) Science 272:1668-1671. The demonstraction that nevoid basal-cell carcinoma 
5 (NBCC) results from mutations in the human patched gene provided an example of the roles 
patched plays in post-embryonic deveolpment. These observations have led the art to understand 
one activity of patched to be a tumor suppressor gene, which may act by inhibiting proliferative 
signals from hedgehog. Our observations set forth below reveal potential new roles for the 
hedgehog! patched pathway in maintenance of mature T-cells and other lyphocytic cell lines. 
10 Accordingly, the present invention contemplates the use of other agents which are capable of 
mimicking or antagonizing, depending on the desired consequence, the effect of the hedgehog 
protein on patched signalling, e.g., as may be identified from the drug screening assays described 
below. 



lU 15 IL Definitions 



For convience, certain terms employed in the specfication, examples, and appended 
claims are collected here. 



f . The term "hedgehog therapeutic" refers to various forms of hedgehog polypeptides, as 

I—I. 

well as peptidomimetics, which can modulate the proliferation/differentiation state of 
[U 20 lymphocytes, by, as will be clear from the context of individual examples, mimicing or 
£[ potentiating (agonizing) or inhibiting (antagonizing) the effects of a naturally-occurring 
p hedgehog protein. A hedgehog therapeutic which mimics or potentiates the activity of a wild- 

type hedgehog protein is a "hedgehog agonist". Conversely, a hedgehog therapeutic which 
inhibits the activity of a wild-type hedgehog protein is a "hedgehog antagonist". 

25 In particular, the term "hedgehog polypeptide" encompasses preparations of hedgehog 

proteins and peptidyl fragments thereof, both agonist and antagonist forms as the specific context 
will make clear. 

As used herein the term "bioactive fragment of a hedgehog protein" refers to a fragment 
of a full-length hedgehog polypeptide, wherein the fragment specifically agonizes or antagonizes 
30 inductive events mediated by wild-type hedgehog proteins. The hedgehog biactive fragment 
preferably is a soluble extracellular portion of a hedgehog protein, where solubility is with 
reference to physiologically compatible solutions. Exemplary bioactive fragments are described 
in PCT publications WO 95/18856 and WO 96/17924. 
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The term "ptc therapeutic" refers to agents which either (i) mimic the effect of hedgehog 
proteins on patched signalling, e.g., which antagonize the cell-cycle inhibitory and/or T-cell 
maturation promoting activity of patched, or (ii) activate or potentiate patched signalling. In 
other embodiments, the ptc therapeutic can be a hedgehog antagonist. The ptc therapeutic can 
5 be, e.g., a peptide, a nucleic acid, a carbohydrate, a small organic molecule, or natural product 
extract (or fraction thereof). 

An "effective amount" of, e.g., a hedgehog or ptc therapeutic, with respect to the subject 
method of treatment, refers to an amount of a composition, e.g., a hedgehog polypeptide, in a 
preparation which, when applied as part of a desired dosage regimen brings enhances or inhibits 
10 (as the case may be) T-cell maturation, relative to the absence of the hedgehog or ptc therapeutic, 
according to clinically acceptable standards for the disorder to be treated. 

A "patient" or "subject" to be treated by the subject method can mean either a human or 

p non-human animal. 

Cl The "growth state" of a cell refers to the rate of proliferation of the cell and the state of 

ru 15 differentiation of the cell. 

y3 "Homology" and "identity" each refer to sequence similarity between two polypeptide 

% sequences, with identity being a more strict comparison. Homology and identity can each be 

s determined by comparing a position in each sequence which may be aligned for purposes of 

fT comparison. When a position in the compared sequence is occupied by the same amino acid 

m 20 residue, then the polypeptides can be referred to as identical at that position; when the equivalent 

y3 site is occupied by the same amino acid (e.g., identical) or a similar amino acid (e.g., similar in 

Q 

g steric and/or electronic nature), then the molecules can be refered to as homologous at that 
position. A percentage of homology or identity between sequences is a function of the number of 
matching or homologous positions shared by the sequences. An "unrelated" or "non- 
25 homologous" sequence shares less than 40 percent identity, though preferably less than 25 
percent identity, with an hedgeog sequence of the present invention. 

The term "corresponds to", when referring to a particular polypeptide or nucleic acid 
sequence is meant to indicate that the sequence of interest is identical or homologous to the 
reference sequence to which it is said to correspond. 

30 The terms "recombinant protein", "heterologous protein" and "exogenous protein" are 

used interchangeably throughout the specification and refer to a polypeptide which is produced 
by recombinant DNA techniques, wherein generally, DNA encoding the polypeptide is inserted 
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into a suitable expression construct which is in turn used to transform a host cell to produce the 
heterologous protein. That is, the polypeptide is expressed from a heterologous nucleic acid. 

A "chimeric protein" or "fusion protein" is a fusion of a first amino acid sequence 

encoding a hedgehog polypeptide with a second amino acid sequence defining a domain foreign 

5 to and not substantially homologous with any domain of hh protein. A chimeric protein may 

present a foreign domain which is found (albeit in a different protein) in an organism which also 

expresses the first protein, or it may be an "interspecies", "intergenic", etc. fusion of protein 

structures expressed by different kinds of organisms, hi general, a fusion protein can be 
represented by the general formula (X) n -(/i/i) m -(Y) n , wherein hh represents all or a portion of the 

10 hedgehog protein, X and Y each independently represent an amino acid sequences which are not 
naturally found as a polypeptide chain contiguous with the hedgehog sequence, m is an integer 
greater than or equal to 1, and each occurrence of n is, independently, 0 or an integer greater than 
_ or equal to 1 (n and m are preferably no greater than 5 or 10). 

~ The term "CD4+ lymphocyte" is intended to mean a lymphocyte having a plurality of cell 

pj 15 surface CD4 molecules as evidenced by binding of an antibody specific for CD4 to the cell 
4* surface, e.g., monoclonal antibody OKT4 (Ortho Diagnostics, Piscataway, N.J.). 

0] The term "CD8 + lymphocyte" is intended to mean a lymphocyte having a plurality of cell 

J" surface CD8 molecules as evidenced by binding of an antibody specific for CD8 to the cell 
H 8 surface, e.g., monoclonal antibody OKT8 (Ortho Diagnostics, Piscataway, N.J.). 

fU 20 "Peripheral blood leukocytes", abbreviated PBL, are intended to mean the cellular 

*M components of the immune system in blood, e.g., lymphocytes, monocytes, eosinophils, 
□ basophils, neutrophils* plasma cells, mast cell precursors, and the like. 

"Lymphocytes" are intended to encompass T-lymphocytes, (also referred to as T-cells), 
B-lymphocytes (also referred to as B-cells), natural killer-lymphocytes (NK-cells), cytotoxic T 
25 lymphocytes (CTL), T-helper lymphocytes, delta gamma -T-cell receptor bearing cells as well as 
precursors and activated derivatives thereof 

"Activated lymphocyte" is intended to mean that subset of lymphocytes which has been i) 
exposed to a stimulus, and ii) has been triggered to change from a metabolically-quiescent cell 
into a cell with increased protein synthesis and/or DNA and RNA synthesis and possibly cell 
30 division. Illustrative stimuli and triggering pathways leading to activated lymphocytes include 
interaction of T cell receptors with antigens, interaction of interleukin receptors with 
interleukins, interaction of growth factor receptors with growth factors, interaction of lymphocyte 
cell surface determinants with antibody (e.g., anti-CD2) or mitogens (e.g., PHA), and the like. 
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"Subject in need thereof 1 is intended to mean a mammal having one or more clinical or 
laboratory indicia of a disorder or disease. The subject may exhibit clinical disease activity or 
may have a subclinical or latent infection. Subjects in need thereof include human and non- 
human primates, mammals, domestic animals, livestock, and the like, e.g., dogs, cats, rodents, 
5 birds, horses, cows, pigs and fish. 

7/7. Exemplary Applications of Method and Compositions 

According to the invention, preparations with ptc and hedgehog therapeutics are used in 
immunoregulatory disorders and diseases in animals and humans, for the prevention or 
10 prophylaxis, control, diagnosis or treatment thereof. 

The subject method has wide applicability to the treatment or prophylaxis of disorders 
n affecting the regulation of lymphocytes, particularly maturation and/or activation of T 
\Q lymphocytes. In general, the method can be characterized as including a step of administering to 

'Hi 

Jz an animal an amount of a ptc or hedgehog therapeutic effective to alter the proliferative and/or 
J: 15 differentiation state of treated lymphocytes. Such therapeutic compositions may be useful in 

treatments designed to modulate, e.g., increase or decrease, an immunological response. Such 
jg diseases and conditions include, but are not limited to, infection (such as bacterial or viral 
* infection), metabolic disease such as diabetes, nutritional deficiency, toxic agents, graft rejection 

or other hyperacute response, or autoimmune disorders. The goals of treatment in each case can 
ry 20 be twofold: (1) to eliminate the cause of the disease or unwanted immunological response, and 
jjf (2) to relieve its symptoms. 

^ In view of their immunosuppressant activity, the hedgehog proteins (and agonists thereof) 

are suitable for preventing and treating diseases and conditions which require a temporary or 
permanent reduction or suppression of an immune response. In particular, their use extends to 

25 suppressing the activation of the proliferation of lymphocytes or cytotoxic T-cells and/or 
immunocytes, e.g. for preventing or treating autoimmune diseases such as diseases of the 
rheumatic type, multiple sclerosis, psoriasis, atopic dermatitis, or for preventing the rejection of 
transplanted tissues or organs such as kidneys, heart, lungs, bone marrow, spleen, skin or cornea, 
in undesirable reactions during or after transfusions, allergic diseases, particularly those which 

30 affect the gastrointestinal tract and which may take the form of an inflammation, or 
inflammatory, proliferative and hyperproliferative diseases and cutaneous manifestations of 
immunological disorders such as eczematous dermatitis, urticaria, vasculitis and scleroderma. 
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Thus, it is particularly advantageous to use immunosuppressive forms of the subject 
hedgehog nd ptc therapeutics clinically for the disorders, diseases and conditions described 
above, i.e. when it is desirable to achieve immunosuppression in an animal or human body. 

Depending on the nature and cause of the disease or disorder to be treated or the 
5 condition which is to be influenced in an animal or human body, it may be desirable to 
administer the hedgehog or ptc therapeutic preparation systemically, locally or topically to the 
tissue or organ in question. Systemic action is desirable, for example, when various organs or 
organ systems are in need of treatment, as is the case for example in systemic autoimmune 
diseases or allergies or in transplants of large, foreign organs or tissues. By contrast, a local effect 
10 would be considered if only local manifestations of an immunological occurrence had to be 
treated, e.g. in small transplants of skin or cornea or in cases of local dermatitis. 

Depending on the duration and intensity of the immunosuppressant activity required, the 

□ hedgehog or ptc therapeutic preparations may be given one or more times a day, as well as 
H intermittently, over a period of several days, weeks or months and in various dosages. 

iij " 

15 In still other embodiments, antagonist (lymphocyte maturation promoting) forms of the 
tO subject ptc and hedgehog therapeutics can be used to treat disorders involving hypoimmunity, 
4: e.g., immunosuppressed or immunocompromised patients. For instance, the effect of hedgehog- 
s' induced immunosuppression, e.g., resulting from endogenous or heterologous activation of the 
H 8 hedgehog signalling pathway, can be counteracted using antagonist forms of the subject ptc and 
ni 20 hedgehog therapeutics. 

t'V's 

S Thus, the subject method contemplates the treatment of immunocompromised subjects to 

□ increase one or more indicia of cell mediated immunity (CMI), humoral immunity, or innate 
resistance to infection, by administering pharmaceutical preparations of an activating ptc or 
hedgehog therapeutic. In certain embodiments, such immunity-promoting activities of 

25 antagonists forms of the subject ptc and hedgehog therapeutics can be identified, e.g., by i) 
increased E-rosette forming cells (E-RFC) in thymocyte cultures after incubation with the subject 
ptc or hedgehog therapeutic agents; ii) increased E-RFC in cultures of thymocytes from aged 
animals after incubation with the subject ptc or hedgehog therapeutic agents; and, iii) increased 
expression of OKT 4< + > in cultures of human peripheral blood T-lymphocytes from patients 

30 with secondary immunodeficiency syndromes following treatment with the subject ptc or 
hedgehog therapeutic agents. Increased expression of CD2 and CD4 accessory molecules on T- 
lymphocytes is compatible with a heighten the state of innate or induced immunity to infection, 
e.g., by upregulating T-helper and cytotoxic T-lymphocytes to respond to lower levels of antigen. 
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Immunodeficiency states fall into three general etiologic categories. First, there is 
immunosuppression that occurs as a consequence of disease processes. Second, there are 
immunodeficiencies that arise because of therapy for other diseases, so-called iatrogenic 
immunodeficiencies. Third, immunodeficiencies may result from direct attack of T-lymphocytes 
5 by the human immunodeficiency virus (HIV) that causes the acquired immunodeficiency 
syndrome (AIDS). 

Common disease processes that lead to immunodeficiency are malnutrition, neoplasias, 
aging, and infections. Malnourished people, patients with advanced widespread cancers and 
people with debilitating illnesses become sick and die more often because impaired cell- 
10 mediated and humoral immune responses increase susceptibility to infections by a variety of 
organisms. A state of generalized deficiency in immune responses is called anergy. Various types 
of infections, especially viral infections, lead to immunosuppression. A drug, such as a 
q maturation-pormoting form of a Ptc or hedgehog therapeutic, e.g., capable of making the T- 
C; helper lymphocyte components of the immune system more robust, will be an important 
r{jj 15 therapeutic agent for increasing the resistance of the patient to infections. For example, Ptc or 
£ hedgehog therapeutic or its analogs, may be: 

01 administered to patients, especially older patients, before or just after admissions to 
^ hospitals in order to reduce the risks of nosocomial (hospital-induced) infections, a common and 
u severe clinical problem 

J— JL 

hj 20 administered to burn victims, because such individuals are especially prone to infections 

2 . administered to patients in anticipation of epidemic infections, for example, in 

'.sew 

Q conjunction with influenza vaccinations or hepatitis vaccinations, to invigorate the immune 
response to pathogens 

administered to patients with asymptomatic viral infections, in order to enhance immune 
25 surveillance of pathogenic organisms and reduce the likelihood of recurrence of disease, for 
example, for individuals who are carriers of herpes viruses, varicella viruses, hepatitis viruses 
and HIV. 

Iatrogenic immunosuppression is most often due to drug therapies which either kill or 
functionally inactivate lymphocytes. Various chemotherapeutic drugs are administered to cancer 
30 patients, and these drugs are usually cytotoxic to both mature and developing lymphocytes as 
well as to granulocyte and monocyte precursors. Thus, cancer chemotherapy is almost always 
accompanied by a period of immunosuppression and increased risk of infections. Radiation 
treatment of cancer carries the same risks. Medications (granulocyte-colony stimulating factor) 
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exist for increasing neutrophils in blood to combat infections that occur after cancer 
chemotherapy, but no medications are currently used for restoring lymphocytic functions. Major 
surgery, for example repair of aneurysms or by-pass operations, also decrease immune function 
in humans. The reasons for the decline in blood lymphocytes that occur because of major surgery 
are not clear, but an agent that elevates lymphocyte functions in such patients have therapeutic 
value in decreasing the likelihood of infections. 

One final form of acquired immunosuppression that should be mentioned results from the 
absence of a spleen, caused by surgical removal of the organ after trauma or for the treatment of 
certain hematologic diseases or as a result of infarction in sickle cell disease. Patients without 
spleens are more susceptible to infections by some organisms, particularly encapsulated bacteria 
such as Streptococcus pneumoniae. The spleen is apparently required for the induction of 
protective humoral immune responses to such organisms. The subject Ptc or hedgehog 
therapeutics can help individuals without a spleen or without a thymus in resistance against 
infection by micro-organisms. 

IK Exemplary hedgehog therapeutic compounds. 

The hedgehog therapeutic compositions of the subject method can be generated by any of 
a variety of techniques, including purification of naturally occurring proteins, recombinantly 
produced proteins and synthetic chemistry. Polypeptide forms of the hedgehog therapeutics are 
preferably derived from vertebrate hedgehog proteins, e.g., have sequences corresponding to 
naturally occurring hedgehog proteins, or fragments thereof, from vertebrate organisms. 
However, it will be appreciated that the hedgehog polypeptide can correspond to a hedgehog 
protein (or fragment thereof) which occurs in any metazoan organism. 

The various naturally-occurring hedgehog proteins from which the subject therapeutics 
can be derived are characterized by a signal peptide, a highly conserved N-terminal region, and a 
more divergent C-terminal domain. In addition to signal sequence cleavage in the secretory 
pathway (Lee, JJ. et al (1992) Cell 71:33-50; Tabata, T. et al (1992) Genes Dev. 2635-2645; 
Chang, D.E. et al (1994) Development 120:3339-3353), hedgehog precursor proteins naturally 
undergo an internal autoproteolytic cleavage which depends on conserved sequences in the C- 
terminal portion (Lee et al (1994) Science 266:1528-1537; Porter et al (1995) Nature 374:363- 
366). This autocleavage leads to a 19 kD N-terminal peptide and a C-terminal peptide of 26-28 
kD (Lee et al (1992) supra; Tabata et al (1992) supra; Chang et al (1994) supra; Lee et al 
(1994) supra; Bumcrot, D.A., et al (1995) Mol Cell Biol 15:2294-2303; Porter et al (1995) 
supra; Ekker, S.C. et al (1995) Curr. Biol 5:944-955; Lai, C.J. et al (1995) Development 
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121:2349-2360). The N-terminal peptide stays tightly associated with the surface of cells in 
which it was synthesized, while the C-terminal peptide is freely diffusible both in vitro and in 
vivo (Lee et al (1994) supra; Bumcrot et al (1995) supra; Mart', E. et al (1995) Development 
121:2537-2547; Roelink, H. et al (1995) Cell 81:445-455). Cell surface retention of the N- 
5 terminal peptide is dependent on autocleavage, as a truncated form of hedgehog encoded by an 
RNA which terminates precisely at the normal position of internal cleavage is diffusible in vitro 
(Porter et al (1995) supra) and in vivo (Porter, J.A. et al (1996) Cell 86, 21-34). Biochemical 
studies have shown that the autoproteolytic cleavage of the hedgehog precursor protein proceeds 
through an internal thioester intermediate which subsequently is cleaved in a nucleophilic 
10 substitution. It is suggested that the nucleophile is a small lipophilic molecule, more particularly 
cholesterol, which becomes covalently bound to the C-terminal end of the N-peptide (Porter et 
al (1996) supra\ tethering it to the cell surface. 

The vertebrate family of hedgehog genes includes at least four members, e.g., paralogs of 
the single drosophila hedgehog gene (SEQ ID No. 19). Three of these members, herein referred 
ft] 15 to as Desert hedgehog (Dhh), Sonic hedgehog (Shh) and Indian hedgehog (Ihh), apparently exist 
*F in all vertebrates, including fish, birds, and mammals. A fourth member, herein referred to as 
tiggie-winkle hedgehog (Thh), appears specific to fish. According to the appended sequence 
listing, (see also Table 1) a chicken Shh polypeptide is encoded by SEQ ID No:l; a mouse Dhh 
polypeptide is encoded by SEQ ID No:2; a mouse Ihh polypeptide is encoded by SEQ ID No:3; a 
20 mouse Shh polypeptide is encoded by SEQ ID No:4 a zebrafish Shh polypeptide is encoded by 
SEQ ID No:5; a human Shh polypeptide is encoded by SEQ ID No:6; a human Ihh polypeptide is 
encoded by SEQ ID No:7; a human Dhh polypeptide is encoded by SEQ ID No. 8; and a 
zebrafish Thh is encoded by SEQ ID No. 9. 

25 Table 1 

Guide to hedgehog sequences in Sequence Listing 
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In addition to the sequence variation between the various hedgehog homologs, the 
hedgehog proteins are apparently present naturally in a number of different forms, including a 
pro-form, a full-length mature form, and several processed fragments thereof. The pro-form 
includes an N-terminal signal peptide for directed secretion of the extracellular domain, while the 
full-length mature form lacks this signal sequence. 

As described above, further processing of the mature form occurs in some instances to 
yield biologically active fragments of the protein. For instance, sonic hedgehog undergoes 
additional proteolytic processing to yield two peptides of approximately 19 kDa and 27 kDa, the 
19kDa fragment corresponding to an proteolytic N-terminal portion of the mature protein. 

In addition to proteolytic fragmentation, the vertebrate hedgehog proteins can also be 
modified post-translationally, such as by glycosylation and/or addition of lipophilic moieties, 
such as stents, fatty acids, etc., though bacterially produced (e.g. unmodified) forms of the 
proteins still maintain certain of the bioactivities of the native protein. Bioactive fragments of 
hedgehog polypeptides of the present invention have been generated and are described in great 
detail in, e.g., PCT publications WO 95/18856 and WO 96/17924. 

There are a wide range of lipophilic moieties with which hedgehog polypeptides can be 
derivatived. The term "lipophilic group", in the context of being attached to a hedgehog 
polypeptide, refers to a group having high hydrocarbon content thereby giving the group high 
affinity to lipid phases. A lipophilic group can be, for example, a relatively long chain alkyl or 
cycloalkyl (preferably n-alkyl) group having approximately 7 to 30 carbons. The alkyl group may 
terminate with a hydroxy or primary amine "tail". To further illustrate, lipophilic molecules 
include naturally-occurring and synthetic aromatic and non-aromatic moieties such as fatty acids, 
sterols, esters and alcohols, other lipid molecules, cage structures such as adamantane and 
buckminsterfullerenes, and aromatic hydrocarbons such as benzene, perylene, phenanthrene, 
anthracene, naphthalene, pyrene, chrysene, and naphthacene. 

In one embodiment, the hedgehog polypeptide is modified with one or more sterol 
moieties, such as cholesterol. See, for example, PCT publication WO 96/17924. In certain 
embodiments, the cholesterol is preferably added to the C-terminal glycine were the hedgehog 
polypeptide corresponds to the naturally-occurring N-terminal proteolytic fragment. 

In another embodiment, the hedgehog polypeptide can be modified with a fatty acid 
moiety, such as a myrostoyl, palmitoyl, stearoyl, or arachidoyl moiety. See, e.g., Pepinsky et al. 
(1998) J Biol. Chem 273: 14037. 
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In addition to those effects seen by cholesterol-addition to the C-terminus or fatty acid 
addition to the N-terminus of extracellular fragments of the protein, at least certain of the 
biological activities of the hedgehog gene products are unexpectedly potentiated by 
derivativation of the protein with lipophilic moieties at other sites on the protein and/or by 
moieties other than cholesterol or fatty acids. Certain aspects of the invention are directed to the 
use of preparations of hedgehog polypeptides which are modified at sites other than N-terminal 
or C-terminal residues of the natural processed form of the protein, and/or which are modified at 
such terminal residues with lipophilic moieties other than a sterol at the C-terminus or fatty acid 
at the N-terminus. 

Particularly useful as lipophilic molecules are alicyclic hydrocarbons, saturated and 
unsaturated fatty acids and other lipid and phospholipid moieties, waxes, cholesterol, 
isoprenoids, terpenes and polyalicyclic hydrocarbons including adamantane and 
buckminsterfullerenes, vitamins, polyethylene glycol or oligoethylene glycol, (Cl-C18)-alkyl 
phosphate diesters, -0-CH2-CH(OH)-0-(C12-C18)-alkyl, and in particular conjugates with 
pyrene derivatives. The lipophilic moiety can be a lipophilic dye suitable for use in the invention 
include, but are not limited to, diphenylhexatriene, Nile Red, N-phenyl-l-naphthylamine, Prodan, 
Laurodan, Pyrene, Perylene, rhodamine, rhodamine B, tetramethylrhodamine, Texas Red, 
sulforhodamine, 1 , 1 '-didodecyl-S^^'^'tetramethylindocarbocyanine perchlorate, octadecyl 
rhodamine B and the BODIPY dyes available from Molecular Probes Inc. 

Other exemplary lipophilic moietites include aliphatic carbonyl radical groups include 1- 
or 2-adamantylacetyl, 3-methyladamant-l-ylacetyl, 3-methyl-3-bromo-l-adamantylacetyl, 1- 
decalinacetyl, camphoracetyl, camphaneacetyl, noradamantylacetyl, norbornaneacetyl, 
bicyclo[2.2.2.]-oct-5-eneacetyl, l-methoxybicyclo[2.2.2.]-oct-5-ene-2-carbonyl, cis-5- 
norbornene-endo-2,3-dicarbonyl, 5-norbornen-2-ylacetyl, (lR)-( - )-myrtentaneacetyl, 2- 
norbornaneacetyl, anti-3-oxo-tricyclo[2.2.1.0<2,6> ]-heptane-7-carbonyl, decanoyl, dodecanoyl, 
dodecenoyl, tetradecadienoyl, decynoyl or dodecynoyl. 

The hedgehog polypeptide can be linked to the hydrophobic moiety in a number of ways 
including by chemical coupling means, or by genetic engineering. 

There are a large number of chemical cross-linking agents that are known to those skilled 
in the art. For the present invention, the preferred cross-linking agents are heterobifunctional 
cross-linkers, which can be used to link the hedgehog polypeptide and hydrophobic moiety in a 
stepwise manner. Heterobifunctional cross-linkers provide the ability to design more specific 
coupling methods for conjugating to proteins, thereby reducing the occurrences of unwanted side 
reactions such as homo-protein polymers. A wide variety of heterobifunctional cross-linkers are 
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known in the art. These include: succinimidyl 4-(N-maleimidomethyl) cyclohexane- 1- 
carboxylate (SMCC), m-Maleimidobenzoyl-N- hydroxysuccinimide ester (MBS); N- 
succinimidyl (4-iodoacetyl) aminobenzoate (SIAB), succinimidyl 4-(p-maleimidophenyl) 
butyrate (SMPB), l-ethyl-3-(3-dimethylaminopropyl) carbodiimide hydrochloride (EDC); 4- 
5 succinimidyloxycarbonyl- a-methyl-a-(2-pyridyldithio)-tolune (SMPT), N-succinimidyl 3-(2- 
pyridyldithio) propionate (SPDP), succinimidyl 6-[3-(2-pyridyldithio) propionate] hexanoate 
(LC-SPDP). Those cross-linking agents having N-hydroxysuccinimide moieties can be obtained 
as the N-hydroxysulfosuccinimide analogs, which generally have greater water solubility. In 
addition, those cross-linking agents having disulfide bridges within the linking chain can be 
1 0 synthesized instead as the alkyl derivatives so as to reduce the amount of linker cleavage in vivo. 

In addition to the heterobifunctional cross-linkers, there exists a number of other cross- 
linking agents including homobifunctional and photoreactive cross-linkers. Disuccinimidyl 
Q suberate (DSS), bismaleimidohexane (BMH) and dimethylpimelimidate-2 HC1 (DMP) are 
y3 examples of useful homobifunctional cross-linking agents, and bis-[B-(4- 
»S 15 azidosalicylamido)ethyl]disulfide (BASED) and N-succinimidyl-6(4-azido-2-nitrophenyl- 
£ amino)hexanoate (SANPAH) are examples of useful photoreactive cross-linkers for use in this 
i: invention. For a recent review of protein coupling techniques, see Means et al. (1990) 
if; Bioconjugate Chemistry 1 :2-12, incorporated by reference herein. 

EI 

u. One particularly useful class of heterobifunctional cross-linkers, included above, contain 

M; 20 the primary amine reactive group, N-hydroxysuccinimide (NHS), or its water soluble analog N- 

g hydroxysulfosuccinimide (sulfo-NHS). Primary amines (lysine epsilon groups) at alkaline pH's 

□ are unprotonated and react by nucleophilic attack on NHS or sulfo-NHS esters. This reaction 

u results in the formation of an amide bond, and release of NHS or sulfo-NHS as a by-product. 

Another reactive group useful as part of a heterobifunctional cross-linker is a thiol 
25 reactive group. Common thiol reactive groups include maleimides, halogens, and pyridyl 
disulfides. Maleimides react specifically with free sulfhydryls (cysteine residues) in minutes, 
under slightly acidic to neutral (pH 6.5-7.5) conditions. Halogens (iodoacetyl functions) react 
with -SH groups at physiological pH's. Both of these reactive groups result in the formation of 
stable thioether bonds. 

30 The third component of the heterobifunctional cross-linker is the spacer arm or bridge. 

The bridge is the structure that connects the two reactive ends. The most apparent attribute of the 
bridge is its effect on steric hindrance. In some instances, a longer bridge can more easily span 
the distance necessary to link two complex biomolecules. For instance, SMPB has a span of 14.5 
angstroms. 
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Preparing protein-protein conjugates using heterobifunctional reagents is a two-step 
process involving the amine reaction and the sulfhydryl reaction. For the first step, the amine 
reaction, the protein chosen should contain a primary amine. This can be lysine epsilon amines 
or a primary alpha amine found at the N-terminus of most proteins. The protein should not 
5 contain free sulfhydryl groups. In cases where both proteins to be conjugated contain free 
sulfhydryl groups, one protein can be modified so that all sulfhydryls are blocked using for 
instance, N-ethylmaleimide (see Partis et al. (1983) J. Pro. Chem. 2:263, incorporated by 
reference herein). Ellman's Reagent can be used to calculate the quantity of sulfhydryls in a 
particular protein (see for example Ellman et al. (1958) Arch. Biochem. Biophys. 74:443 and 
10 Riddles et al. (1979) Anal. Biochem. 94:75, incorporated by reference herein). 

The reaction buffer should be free of extraneous amines and sulfhydryls. The pH of the 
reaction buffer should be 7.0-7.5. This pH range prevents maleimide groups from reacting with 
amines, preserving the maleimide group for the second reaction with sulfhydryls. 

The NHS-ester containing cross-linkers have limited water solubility. They should be 
ry 15 dissolved in a minimal amount of organic solvent (DMF or DMSO) before introducing the cross- 
ly linker into the reaction mixture. The cross-linker/solvent forms an emulsion which will allow the 
m reaction to occur. 

g The sulfo-NHS ester analogs are more water soluble, and can be added directly to the 

H reaction buffer. Buffers of high ionic strength should be avoided, as they have a tendency to "salt 
pj 20 out" the sulfo-NHS esters. To avoid loss of reactivity due to hydrolysis, the cross-linker is added 
to the reaction mixture immediately after dissolving the protein solution. 

The reactions can be more efficient in concentrated protein solutions. The more alkaline 
the pH of the reaction mixture, the faster the rate of reaction. The rate of hydrolysis of the NHS 
and sulfo-NHS esters will also increase with increasing pH. Higher temperatures will increase 
25 the reaction rates for both hydrolysis and acylation. 

Once the reaction is completed, the first protein is now activated, with a sulfhydryl 
reactive moiety. The activated protein may be isolated from the reaction mixture by simple gel 
filtration or dialysis. To carry out the second step of the cross-linking, the sulfhydryl reaction, 
the lipophilic group chosen for reaction with maleimides, activated halogens, or pyridyl 
30 disulfides must contain a free sulfhydryl. Alternatively, a primary amine may be modified with 
to add a sulfhydryl 
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In all cases, the buffer should be degassed to prevent oxidation of sulfhydryl groups. 
EDTA may be added to chelate any oxidizing metals that may be present in the buffer. Buffers 
should be free of any sulfhydryl containing compounds. 

Maleimides react specifically with -SH groups at slightly acidic to neutral pH ranges (6.5- 
5 7.5). A neutral pH is sufficient for reactions involving halogens and pyridyl disulfides. Under 
these conditions, maleimides generally react with -SH groups within a matter of minutes. Longer 
reaction times are required for halogens and pyridyl disulfides. 

The first sulfhydryl reactive-protein prepared in the amine reaction step is mixed with the 
sulfhydryl-containing lipophilic group under the appropriate buffer conditions. The conjugates 
10 can be isolated from the reaction mixture by methods such as gel filtration or by dialysis. 

Exemplary activated lipophilic moieties for conjugation include: N-(l-pyrene)maleimide; 
2,5-dimethoxystilbene-4 f -maleimide, eosin-5-maleimide; fluorescein-5-maleimide; N-(4-(6- 
j3 dimethylamino- 2-benzofuranyl)phenyl)maleimide; benzophenone-4-maleimide; 4- 
N dimethylaminophenylazophenyl- 4 f -maleimide (DABMI), tetramethylrhodamine-5-maleimide, 
15 tetramethylrhodamine-6-maleimide, Rhodamine RedTM . C2 maleimide, N-(5- 
aminopentyl)maleimide, trifluoroacetic acid salt, N-(2-aminoethyl)maleimide, trifluoroacetic 
J acid salt, Oregon GreenTM 488 maleimide, N-(2-((2-(((4-azido- 2,3,5,6-tetrafluoro)benzoyl) 
amino)ethyl)dithio)ethyl)maleimide (TFPAM-SS1), 2-(l-(3-dimethylaminopropyl) -indol-3-yl)- 
g 3-(indol-3-yl) maleimide (bisindolylmaleimide; GF 109203X), BODIPY® FL N-(2- 
m 20 aminoethyl)maleimide, N-(7-dimethylamino- 4-methylcoumarin-3-yl)maleimide (DACM), 
By AlexaTM 488 C5 maleimide, AlexaTM 594 C5 maleimide, sodium saltN-(l-pyrene)maleimide, 
j=? 2,5-dimethoxystilbene-4 , -maleimide, eosin-5-maleimide, fluorescein-5 -maleimide, N-(4-(6- 
dimethylamino- 2-benzofuranyl)phenyl)maleimide, benzophenone-4-maleimide, 4- 
dimethylaminophenylazophenyl- 4'-maleimide, l-(2-maleimidylethyl)-4-(5- (4- 
25 methoxyphenyl)oxazol-2- yl)pyridinium methanesulfonate, tetramethylrhodamine-5-maleimide, 
tetramethylrhodamine-6-maleimide, Rhodamine RedTM C2 maleimide, N-(5- 
aminopentyl)maleimide, N-(2-aminoethyl)maleimide, N-(2-((2-(((4-azido- 2,3,5,6- 
tetrafluoro)benzoyl) amino)ethyl)dithio)ethyl)maleimide, 2-(l-(3-dimethylaminopropyl) -indol-3- 
yl)-3-(indol-3-yl) maleimide, N-(7-dimethylamino- 4-methylcoumarin-3-yl)maleimide (DACM), 
30 1 lH-Benzo[a]fluorene, Benzo[a]pyrene. 

In one embodiment, the hedgehog polypeptide can be derivatived using pyrene 
maleimide, which can be purchased from Molecular Probes (Eugene, Oreg.), e.g., N-(l- 
pyrene)maleimide or 1-pyrenemethyl iodoacetate (PMIA ester). 
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For those embodiments wherein the hydophobic moiety is a polypeptide, the modified 
hedgehog polypeptide of this invention can be constructed as a fusion protein, containing the 
hedgehog polypeptide and the hydrophobic moiety as one contiguous polypeptide chain. 

In certain embodiments, the lipophilic moiety is an amphipathic polypeptide, such as 
magainin, cecropin, attacin, melittin, gramicidin S, alpha-toxin of Staph, aureus, alamethicin or a 
synthetic amphipathic polypeptide. Fusogenic coat proteins from viral particles can also be a 
convenient source of amphipathic sequences for the subject hedgehog proteins 

Moreover, mutagenesis can be used to create modified hh polypeptides, e.g., for such 
purposes as enhancing therapeutic or prophylactic efficacy, or stability (e.g., ex vivo shelf life and 
resistance to proteolytic degradation in vivo). Such modified peptides can be produced, for 
instance, by amino acid substitution, deletion, or addition. Modified hedgehog polypeptides can 
also include those with altered post-translational processing relative to a naturally occurring 
hedgehog protein, e.g., altered glycosylation, cholesterolization, prenylation and the like. 

In one embodiment, the hedgehog therapeutic is a polypeptide encodable by a nucleotide 
sequence that hybridizes under stringent conditions to a hedgehog coding sequence represented 
in one or more of SEQ ID Nos:l-7. Appropriate stringency conditions which promote DNA 
hybridization, for example, 6.0 x sodium chloride/sodium citrate (SSC) at about 45°C, followed 
by a wash of 2.0 x SSC at 50°C, are known to those skilled in the art or can be found in Current 
Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. For example, the. 
salt concentration in the wash step can be selected from a low stringency of about 2.0 x SSC at 
50°C to a high stringency of about 0.2 x SSC at 50°C. In addition, the temperature in the wash 
step can be increased from low stringency conditions at room temperature, about 22°C, to high 
stringency conditions at about 65°C. 

As described in the literature, genes for other hedgehog proteins, e.g., from other animals, 
can be obtained from mRNA or genomic DNA samples using techniques well known in the art. 
For example, a cDNA encoding a hedgehog protein can be obtained by isolating total mRNA 
from a cell, e.g. a mammalian cell, e.g. a human cell, including embryonic cells. Double 
stranded cDNAs can then be prepared from the total mRNA, and subsequently inserted into a 
suitable plasmid or bacteriophage vector using any one of a number of known techniques. The 
gene encoding a hedgehog protein can also be cloned using established polymerase chain 
reaction techniques. 

Preferred nucleic acids encode a hedgehog polypeptide comprising an amino acid 
sequence at least 60% homologous or identical, more preferably 70% homologous or identical, 
and most preferably 80% homologous or identical with an amino acid sequence selected from the 
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group consisting of SEQ ID Nos:8-14. Nucleic acids which encode polypeptides at least about 
90%, more preferably at least about 95%, and most preferably at least about 98-99% homology 
or identity with an amino acid sequence represented in one of SEQ ID Nos:8-14 are also within 
the scope of the invention. 

5 In addition to native hedgehog proteins, hedgehog polypeptides preferred by the present 

invention are at least 60% homologous or identical, more preferably 70% homologous or 
identical and most preferably 80% homologous or identical with an amino acid sequence 
represented by any of SEQ ID Nos:8-14. Polypeptides which are at least 90%, more preferably at 
least 95%, and most preferably at least about 98-99% homologous or identical with a sequence 
10 selected from the group consisting of SEQ ID Nos:8-14 are also within the scope of the 
invention. The only prerequisite is that the hedgehog polypeptide is capable of modulating the 
growth state of T lymphocytes. 

□ The term "recombinant protein" refers to a polypeptide of the present invention which is 

r; produced by recombinant DNA techniques, wherein generally, DNA encoding a hedgehog 

fU 15 polypeptide is inserted into a suitable expression vector which is in turn used to transform a host 

'% cell to produce the heterologous protein. Moreover, the phrase "derived from", with respect to a 

jjl recombinant hedgehog gene, is meant to include within the meaning of "recombinant protein" 
those proteins having an amino acid sequence of a native hedgehog protein, or an amino acid 

y, sequence similar thereto which is generated by mutations including substitutions and deletions 

N» 20 (including truncation) of a naturally occurring form of the protein. 

ffl The method of the present invention can also be carried out using variant forms of the 

;war. 

;r: naturally occurring hedgehog polypeptides, e.g., mutational variants. 

As is known in the art, hedgehog polypeptides can be produced by standard biological 
techniques or by chemical synthesis. For example, a host cell transfected with a nucleic acid 

25 vector directing expression of a nucleotide sequence encoding the subject polypeptides can be 
cultured under appropriate conditions to allow expression of the peptide to occur. The 
polypeptide hedgehog may be secreted and isolated from a mixture of cells and medium 
containing the recombinant hedgehog polypeptide. Alternatively, the peptide may be retained 
cytoplasmically by removing the signal peptide sequence from the recombinant hedgehog gene 

30 and the cells harvested, lysed and the protein isolated. A cell culture includes host cells, media 
and other byproducts. Suitable media for cell culture are well known in the art. The recombinant 
hedgehog polypeptide can be isolated from cell culture medium, host cells, or both using 
techniques known in the art for purifying proteins including ion-exchange chromatography, gel 
filtration chromatography, ultrafiltration, electrophoresis, and immunoaffinity purification with 



-20- 



ONV-080.01 



antibodies specific for such peptide. In a preferred embodiment, the recombinant hedgehog 
polypeptide is a fusion protein containing a domain which facilitates its purification, such as an 
hedgehog/GST fusion protein. The host cell may be any prokaryotic or eukaryotic cell. 

Recombinant hedgehog genes can be produced by ligating nucleic acid encoding an 
5 hedgehog protein, or a portion thereof, into a vector suitable for expression in either prokaryotic 
cells, eukaryotic cells, or both. Expression vectors for production of recombinant forms of the 
subject hedgehog polypeptides include plasmids and other vectors. For instance, suitable vectors 
for the expression of a hedgehog polypeptide include plasmids of the types: pBR322-derived 
plasmids, pEMBL-derived plasmids, pEX-derived plasmids, pBTac-derived plasmids and pUC- 
10 derived plasmids for expression in prokaryotic cells, such as E. coli. 

A number of vectors exist for the expression of recombinant proteins in yeast. For 
instance, YEP24, YIPS, YEP51, YEP52, pYES2, and YRP17 are cloning and expression vehicles 
u useful in the introduction of genetic constructs into 5. cerevisiae (see, for example, Broach et ah 
r§ (1983) in Experimental Manipulation of Gene Expression, ed. M. Inouye Academic Press, p. 83, 
flj 15 incorporated by reference herein). These vectors can replicate in E. coli due to the presence of 
tl the pBR322 ori, and in S. cerevisiae due to the replication determinant of the yeast 2 micron 
m plasmid. In addition, drug resistance markers such as ampicillin can be used. In an illustrative 
embodiment, an hedgehog polypeptide is produced recombinantly utilizing an expression vector 
Mi generated by sub-cloning the coding sequence of one of the hedgehog genes represented in SEQ 
M20 IDNos:l-7. 

fU 

83 The preferred mammalian expression vectors contain both prokaryotic sequences, to 

rf facilitate the propagation of the vector in bacteria, and one or more eukaryotic transcription units 
that are expressed in eukaryotic cells. The pcDNAI/amp, pcDNAI/neo, pRc/CMV, pSV2gpt, 
pSV2neo, pSV2-dhfr, pTk2, pRSVneo, pMSG, pSVT7, pko-neo and pHyg derived vectors are 
25 examples of mammalian expression vectors suitable for transfection of eukaryotic cells. Some of 
these vectors are modified with sequences from bacterial plasmids, such as pBR322, to facilitate 
replication and drug resistance selection in both prokaryotic and eukaryotic cells. Alternatively, 
derivatives of viruses such as the bovine papillomavirus (BPV-1), or Epstein-Barr virus (pHEBo, 
pREP-derived and p205) can be used for transient expression of proteins in eukaryotic cells. The 
30 various methods employed in the preparation of the plasmids and transformation of host 
organisms are well known in the art. For other suitable expression systems for both prokaryotic 
and eukaryotic cells, as well as general recombinant procedures, see Molecular Cloning A 
Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor 
Laboratory Press: 1989) Chapters 16 and 17. 
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In some instances, it may be desirable to express the recombinant hedgehog polypeptide 
by the use of a baculovirus expression system. Examples of such baculovirus expression systems 
include pVL-derived vectors (such as pVL1392, pVL1393 and pVL941), pAcUW-derived 
vectors (such as pAcUWl), and pBlueBac-derived vectors (such as the B-gal containing 
5 pBlueBac m). 

When it is desirable to express only a portion of an hedgehog protein, such as a form 
lacking a portion of the N-terminus, i.e. a truncation mutant which lacks the signal peptide, it may 
be necessary to add a start codon (ATG) to the oligonucleotide fragment containing the desired 
sequence to be expressed. It is well known in the art that a methionine at the N-terminal position 
10 can be enzymatically cleaved by the use of the enzyme methionine aminopeptidase (MAP). MAP 
has been cloned from E. coli (Ben-Bassat et al. (1987) J. Bacteriol 169:751-757) and Salmonella 
typhimurium and its in vitro activity has been demonstrated on recombinant proteins (Miller et al. 
f*\ (1987) PNAS £4:2718-1722). Therefore, removal of an N-terminal methionine, if desired, can be 
jfi achieved either in vivo by expressing hedgehog-derived polypeptides in a host which produces 
ey 15 MAP (e.g., E. coli or CM89 or S. cerevisiae), or in vitro by use of purified MAP (e.g., procedure 
45 of Miller et al., supra). 

gfj Alternatively, the coding sequences for the polypeptide can be incorporated as a part of a 

fusion gene including a nucleotide sequence encoding a different polypeptide. It is widely 
Li, appreciated that fusion proteins can also facilitate the expression of proteins, and accordingly, can 
M- 20 be used in the expression of the hedgehog polypeptides of the present invention. For example, 
j^J hedgehog polypeptides can be generated as glutathione-S-transferase (GST-fusion) proteins. 
O Such GST-fusion proteins can enable easy purification of the hedgehog polypeptide, as for 
u example by the use of glutathione-derivatized matrices (see, for example, Current Protocols in 
Molecular Biology, eds. Ausubel et al. (N.Y.: John Wiley & Sons, 1991)). In another 
25 embodiment, a fusion gene coding for a purification leader sequence, such as a poly- 
(His)/enterokinase cleavage site sequence, can be used to replace the signal sequence which 
naturally occurs at the N-terminus of the hedgehog protein (e.g.of the pro-form, in order to permit 
purification of the poly(His)-hedgehog protein by affinity chromatography using a Ni 2+ metal 
resin. The purification leader sequence can then be subsequently removed by treatment with 
30 enterokinase (e.g., see Hochuli et al. (1987) J. Chromatography 411:177; and Janknecht et al. 
PNAS 88:8972). 

Techniques for making fusion genes are known to those skilled in the art. Essentially, the 
joining of various DNA fragments coding for different polypeptide sequences is performed in 
accordance with conventional techniques, employing blunt-ended or stagger-ended termini for 
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ligation, restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive 
ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining, and enzymatic 
ligation. In another embodiment, the fusion gene can be synthesized by conventional techniques 
including automated DNA synthesizers. Alternatively, PCR amplification of gene fragments can 
5 be carried out using anchor primers which give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be annealed to generate a chimeric gene 
sequence (see, for example, Current Protocols in Molecular Biology, eds. Ausubel et al. John 
Wiley & Sons: 1992). 

Hedgehog polypeptides may also be chemically modified to create hedgehog derivatives 
10 by forming covalent or aggregate conjugates with other chemical moieties, such as glycosyl 
groups* cholesterol, isoprenoids, lipids, phosphate, acetyl groups and the like. Covalent 
derivatives of hedgehog proteins can be prepared by linking the chemical moieties to functional 
i-h groups on amino acid sidechains of the protein or at the N-terminus or at the C-terminus of the 

:t=sr 

J3 polypeptide. 

;rdl5 For instance, hedgehog proteins can be generated to include a moiety, other than 

% sequence naturally associated with the protein, that binds a component of the extracellular matrix 

gi and enhances localization of the analog to cell surfaces. For example, sequences derived from 

^ the fibronectin "type-Hi repeat", such as a tetrapeptide sequence R-G-D-S (Pierschbacher et al. 

*y, (1984) Nature 309:30-3; and Kornblihtt et al. (1985) EMBO 4:1755-9) can be added to the 

K20 hedgehog polypeptide to support attachment of the chimeric molecule to a cell through binding 

^ ECM components (Ruoslahti et al. (1987) Science 238:491-497; Pierschbacheret al. (1987) J. 

5 Biol Chem. 262:17294-8.; Hynes (1987) Cell 48:549-54; and Hynes (1992) Cell 69:1 1-25). 

~=-: 

In a preferred embodiment, the hedgehog polypeptide is isolated from, or is otherwise 
substantially free of, other cellular proteins, especially other extracellular or cell surface 

25 associated proteins which may normally be associated with the hedgehog polypeptide, unless 
provided in the form of fusion protein with the hedgehog polypeptide. The term "substantially 
free of other cellular or extracellular proteins" (also referred to herein as "contaminating 
proteins") or "substantially pure preparations" or "purified preparations" are defined as 
encompassing preparations of hedgehog polypeptides having less than 20% (by dry weight) 

30 contaminating protein, and preferably having less than 5% contaminating protein. By "purified", 
it is meant that the indicated molecule is present in the substantial absence of other biological 
macromolecules, such as other proteins. The term "purified" as used herein preferably means at 
least 80% by dry weight, more preferably in the range of 95-99% by weight, and most preferably 
at least 99.8% by weight, of biological macromolecules of the same type present (but water, 



-23- 



ONV-080.01 



buffers, and other small molecules, especially molecules having a molecular weight of less than 
5000, can be present). The term "pure" as used herein preferably has the same numerical limits as 
"purified" immediately above. 

As described above for recombinant polypeptides, isolated hedgehog polypeptides can 
include all or a portion of the amino acid sequences represented in any of SEQ ID Nos:10-18 or 
20, or a homologous sequence thereto. Preferred fragments of the subject hedgehog proteins 
correspond to the N-terminal and C-terminal proteolytic fragments of the mature protein. 
Bioactive fragments of hedgehog polypeptides are described in great detail in PCT publications 
WO 95/1 8856 and WO 96/1 7924. 

With respect to bioctive fragments of hedgehog polypeptide, preferred hedgehog 
therapeutics include at least 50 (contiguous) amino acid residues of a hedgehog polypeptide, 
more preferably at least 100 (contiguous), and even more preferably at least 150 (contiguous) 
residues. 

Another preferred hedgehog polypeptide which can be included in the hedgehog 
therapeutic is an N-terminal fragment of the mature protein having a molecular weight of 
approximately 19 kDa. 

Preferred human hedgehog proteins include N-terminal fragments corresponding 
approximately to residues 24-197 of SEQ ID No. 15, 28-202 of SEQ ID No. 16, and 23-198 of 
SEQ ID No. 17. By "corresponding approximately" it is meant that the sequence of interest is at 
most 20 amino acid residues different in length to the reference sequence, though more 
preferably at most 5, 10 or 15 amino acid different in length. 

As described above for recombinant polypeptides, isolated hedgehog polypeptides can 
include all or a portion of the amino acid sequences represented in SEQ ID No:8, SEQ ID No:9, 
SEQ ID No:10, SEQ ID No:ll, SEQ ID No: 12, SEQ ID No:13 or SEQ ID No:14, or a 
homologous sequence thereto. Preferred fragments of the subject hedgehog proteins correspond 
to the N-terminal and C-terminal proteolytic fragments of the mature protein. Bioactive 
fragments of hedgehog polypeptides are described in great detail in PCT publications WO 
95/1 8856 and WO 96/1 7924. 

Still other preferred hedgehog polypeptides includes an amino acid sequence represented 
by the formula A-B wherein: (i) A represents all or the portion of the amino acid sequence 
designated by residues 1-168 of SEQ ID No:21; and B represents at least one amino acid residue 
of the amino acid sequence designated by residues 169-221 of SEQ ID No:21; (ii) A represents 
all or the portion of the amino acid sequence designated by residues 24-193 of SEQ ID No: 15; 
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and B represents at least one amino acid residue of the amino acid sequence designated by 
residues 194-250 of SEQ ED No:15; (iii) A represents all or the portion of the amino acid 
sequence designated by residues 25-193 of SEQ ID No:13; and B represents at least one amino 
acid residue of the amino acid sequence designated by residues 194-250 of SEQ ID No:13; (iv) A 
5 represents all or the portion of the amino acid sequence designated by residues 23-193 of SEQ ED 
No:ll; and B represents at least one amino acid residue of the amino acid sequence designated 
by residues 194-250 of SEQ ID No:ll; (v) A represents all or the portion of the amino acid 
sequence designated by residues 28-197 of SEQ ID No:12; and B represents at least one amino 
acid residue of the amino acid sequence designated by residues 198-250 of SEQ ID No: 12; (vi) A 
10 represents all or the portion of the amino acid sequence designated by residues 29-197 of SEQ ID 
No: 16; and B represents at least one amino acid residue of the amino acid sequence designated 
by residues 198-250 of SEQ ID No: 16; or (vii) A represents all or the portion of the amino acid 
sequence designated by residues 23-193 of SEQ ID No. 17, and B represents at least one amino 
1^ acid residue of the amino acid sequence designated by residues 194-250 of SEQ ID No. 17. In 
SJ 15 certain preferred embodiments, A and B together represent a contiguous polypeptide sequence 
* ™ designated sequence, A represents at least 25, 50, 75, 100, 125 or 150 (contiguous) amino acids 

SSfSS 

yh of the designated sequence, and B represents at least 5, 10, or 20 (contiguous) amino acid 
residues of the amino acid sequence designated by corresponding entry in the sequence listing, 

7" and A and B together preferably represent a contiguous sequence corresponding to the sequence 

20 listing entry. Similar fragments from other hedgehog also contemplated, e.g., fragments which 

j* g correspond to the preferred fragments from the sequence listing entries which are enumerated 

gj above. In preferred embodiments, the hedgehog polypeptide includes a C-terminal glycine (or 

;ff other appropriate residue) which is derivatized with a cholesterol. 

Isolated peptidyl portions of hedgehog proteins can be obtained by screening peptides 
25 recombinantly produced from the corresponding fragment of the nucleic acid encoding such 
peptides. In addition, fragments can be chemically synthesized using techniques known in the art 
such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example, a hedgehog 
polypeptide of the present invention may be arbitrarily divided into fragments of desired length 
with no overlap of the fragments, or preferably divided into overlapping fragments of a desired 
30 length. The fragments can be produced (recombinantly or by chemical synthesis) and tested to 
identify those peptidyl fragments which can function as either agonists or antagonists of a wild- 
type (e.g., "authentic") hedgehog protein. For example, Roman et al. (1994) Eur J Biochem 
222:65-73 describe the use of competitive-binding assays using short, overlapping synthetic 
peptides from larger proteins to identify binding domains. 
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The recombinant hedgehog polypeptides of the present invention also include homologs 
of the authentic hedgehog proteins, such as versions of those protein which are resistant to 
proteolytic cleavage, as for example, due to mutations which alter potential cleavage sequences 
or which inactivate an enzymatic activity associated with the protein. Hedgehog homologs of the 
present invention also include proteins which have been post-translationally modified in a 
manner different than the authentic protein. Exemplary derivatives of hedgehog proteins include 
polypeptides which lack N-glycosylation sites (e.g. to produce an unglycosylated protein), which 
lack sites for cholesterolization, and/or which lack N-terminal and/or C-terminal sequences. 

Modification of the structure of the subject hedgehog polypeptides can also be for such 
purposes as enhancing therapeutic or prophylactic efficacy, or stability (e.g., ex vivo shelf life and 
resistance to proteolytic degradation in vivo). Such modified peptides, when designed to retain at 
least one activity of the naturally-occurring form of the protein, are considered functional 
equivalents of the hedgehog polypeptides described in more detail herein. Such modified 
peptides can be produced, for instance, by amino acid substitution, deletion, or addition. 

It is well known in the art that one could reasonably expect that certain isolated 
replacements of amino acids, e.g., replacement of an amino acid residue with another related 
amino acid (i.e. isosteric and/or isoelectric mutations), can be carried out without major effect on 
the biological activity of the resulting molecule. Conservative replacements are those that take 
place within a family of amino acids that are related in their side chains. Genetically encoded 
amino acids are can be divided into four families: (1) acidic = aspartate, glutamate; (2) basic = 
lysine, arginine, histidine; (3) nonpolar = alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan; and (4) uncharged polar = glycine, asparagine, glutamine, 
cysteine, serine, threonine, tyrosine. Phenylalanine, tryptophan, and tyrosine are sometimes 
classified jointly as aromatic amino acids. In similar fashion, the amino acid repertoire can be 
grouped as (1) acidic = aspartate, glutamate; (2) basic = lysine, arginine histidine, (3) aliphatic = 
glycine, alanine, valine, leucine, isoleucine, serine, threonine, with serine and threonine 
optionally be grouped separately as aliphatic-hydroxyl; (4) aromatic = phenylalanine, tyrosine, 
tryptophan; (5) amide = asparagine, glutamine; and (6) sulfur -containing = cysteine and 
methionine, (see, for example, Biochemistry, 2nd ed., Ed. by L. Stryer, WH Freeman and Co.: 
1981). Whether a change in the amino acid sequence of a peptide results in a functional 
hedgehog homolog (e.g. functional in the sense that it acts to mimic or antagonize the wild-type 
form) can be readily determined by assessing the ability of the variant peptide to produce a 
response in cells in a fashion similar to the wild-type protein, or competitively inhibit such a 
response. Polypeptides in which more than one replacement has taken place can readily be tested 
in the same manner. 
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It is specifically contemplated that the methods of the present invention can be carried 
using homologs of naturally occurring hedgehog proteins. In one embodiment, the invention 
contemplates using hedgehog polypeptides generated by combinatorial mutagenesis. Such 
methods, as are known in the art, are convenient for generating both point and truncation 
5 mutants, and can be especially useful for identifying potential variant sequences (e.g. homologs) 
that are functional in binding to a receptor for hedgehog proteins. The purpose of screening such 
combinatorial libraries is to generate, for example, novel hedgehog homologs which can act as 
either agonists or antagonist. To illustrate, hedgehog homologs can be engineered by the present 
method to provide more efficient binding to a cognate receptor, such as patched, yet still retain at 
10 least a portion of an activity associated with hedgehog. Thus, combinatorially-derived homologs 
can be generated to have an increased potency relative to a naturally occurring form of the 
protein. Likewise, hedgehog homologs can be generated by the present combinatorial approach 
to act as antagonists, in that they are able to mimic, for example, binding to other extracellular 
^ matrix components (such as receptors), yet not induce any biological response, thereby inhibiting 
SI 15 the action of authentic hedgehog or hedgehog agonists. Moreover, manipulation of certain 
' domains of hedgehog by the present method can provide domains more suitable for use in fusion 

proteins, such as one that incorporates portions of other proteins which are derived from the 
0]j extracellular matrix and/or which bind extracellular matrix components. 

f To further illustrate the state of the art of combinatorial mutagenesis, it is noted that the 

r 20 review article of Gallop et al. (1994) J Med Chem 37:1233 describes the general state of the art 
of combinatorial libraries as of the earlier 1990's. In particular, Gallop et al state at page 1239 
ff [s]creening the analog libraries aids in determining the minimum size of the active sequence and 
in identifying those residues critical for binding and intolerant of substitution". In addition, the 
Ladner et al. PCT publication WO90/02809, the Goeddel et al. U.S. Patent 5,223,408, and the 
25 Markland et al. PCT publication W092/15679 illustrate specific techniques which one skilled in 
the art could utilize to generate libraries of hedgehog variants which can be rapidly screened to 
identify variants/fragments which retained a particular activity of the hedgehog polypeptides. 
These techniques are exemplary of the art and demonstrate that large libraries of related 
variants/truncants can be generated and assayed to isolate particular variants without undue 
30 experimentation. Gustin et al. (1993) Virology 193:653, and Bass et al. (1990) Proteins: 
Structure, Function and Genetics 8:309-314 also describe other exemplary techniques from the 
art which can be adapted as means for generating mutagenic variants of hedgehog polypeptides. 

Indeed, it is plain from the combinatorial mutagenesis art that large scale mutagenesis of 
hedgehog proteins, without any preconceived ideas of which residues were critical to the 
35 biological function, and generate wide arrays of variants having equivalent biological activity. 
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Indeed, it is the ability of combinatorial techniques to screen billions of different variants by high 
throughout analysis that removes any requirement of a priori understanding or knowledge of 
critical residues. 

To illsutrate, the amino acid sequences for a population of hedgehog homologs or other 
5 related proteins are aligned, preferably to promote the highest homology possible. Such a 
population of variants can include, for example, hedgehog homologs from one or more species. 
Amino acids which appear at each position of the aligned sequences are selected to create a 
degenerate set of combinatorial sequences. In a preferred embodiment, the variegated library of 
hedgehog variants is generated by combinatorial mutagenesis at the nucleic acid level, and is 
10 encoded by a variegated gene library. For instance, a mixture of synthetic oligonucleotides can be 
enzymatically ligated into gene sequences such that the degenerate set of potential hedgehog 
sequences are expressible as individual polypeptides, or alternatively, as a set of larger fusion 
□ proteins (e.g. for phage display) containing the set of hedgehog sequences therein. 

m - ' - 

y As illustrated in PCT publication WO 95/18856, to analyze the sequences of a population 

Py 15 of variants, the amino acid sequences of interest can be aligned relative to sequence homology. 

jl The presence or absence of amino acids from an aligned sequence of a particular variant is 

III relative to a chosen consensus length of a reference sequence, which can be real or artificial. 

* In an illustrative embodiment, alignment of exons 1, 2 and a portion of exon 3 encoded 

u sequences (e.g. the N-terminal approximately 221 residues of the mature protein) of each of the 
=^ 20 Shh clones produces a degenerate set of Shh polypeptides represented by the general formula: 



WW 



C-G-P-G-R-G-X(l)-G-X(2)-R-R-H-P-K-K-L-T-P-L-A-Y-K-Q-F-I-P-N-V-A-E- 
K-T-L-G-A-S-G-R-Y-E-G-K-I-X(3)-R-N-S-E-R-F-K-E-L-T-P-N-Y-N-P-D-I-I-F- 
K-D-E-E-N-T-G-A-D-R-L-M-T-Q-R-C-K-D-K-L-N-X(4)-L-A-I-S-V-M-N-X(5)- 
W-P-G-V-X(6)-L-R-V-T-E-G-W-D-E-D-G-H-H-X(7)-E-E-S-L-H-Y-E-G-R-A-V- 
25 D-I-T-T-S-D-R-D-X(8)-S-K-Y-G-X(9)-L-X(10)-R-L-A-V-E-A-G-F-D-W-V-Y- 
Y-E-S-K-A-H-I-H-C-S-V-K-A-E-N-S-V-A-A-K-S-G-G-C-F-P-G-S-A-X(ll)-V- 
X(12)-L-X(13)-X(14)-G-G-X(15)-K-X-(16)-V-K-D-L-X(17)-P-G-D-X(18)-V-L- 
A-A-D-X(19)-X(20)-G-X(21)-L-X(22)-X(23)-S-D-F-X(24)-X(25)-F-X(26)-D-R 
(SEQIDNo:21 

30 wherein each of the degenerate positions "X" can be an amino acid which occurs in that position 
in one of the human, mouse, chicken or zebrafish Shh clones, or, to expand the library, each X 
can also be selected from amongst amino acid residue which would be conservative substitutions 
for the amino acids which appear naturally in each of those positions. For instance, Xaa(l) 
represents Gly, Ala, Val, Leu, He, Phe, Tyr or Trp ; Xaa(2) represents Arg, His or Lys; Xaa(3) 

35 represents Gly, Ala, Val, Leu, He, Ser or Thr; Xaa(4) represents Gly, Ala, Val, Leu, He, Ser or 
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Thr; Xaa(5) represents Lys, Arg, His, Asn or Gin; Xaa(6) represents Lys, Arg or His; Xaa(7) 
represents Ser, Thr, Tyr, Trp or Phe; Xaa(8) represents Lys, Arg or His; Xaa(9) represents Met, 
Cys, Ser or Thr; Xaa(lO) represents Gly, Ala, Val, Leu, He, Ser or Thr; Xaa(ll) represents Leu, 
Val, Met, Thr or Ser; Xaa(12) represents His, Phe, Tyr, Ser, Thr, Met or Cys; Xaa(13) represents 
5 Gin, Asn, Glu, or Asp; Xaa(14) represents His, Phe, Tyr, Thr, Gin, Asn, Glu or Asp; Xaa(15) 
represents Gin, Asn, Glu, Asp, Thr, Ser, Met or Cys; Xaa(16) represents Ala, Gly, Cys, Leu, Val 
or Met; Xaa(17) represents Arg, Lys, Met, He, Asn, Asp, Glu, Gin, Ser, Thr or Cys; Xaa(18) 
represents Arg, Lys, Met or He; Xaa(19) represents Ala, Gly, Cys, Asp, Glu, Gin, Asn, Ser, Thr 
or Met; Xaa(20) represents Ala, Gly, Cys, Asp, Asn, Glu or Gin; Xaa(21) represents Arg, Lys, 
10 Met, He, Asn, Asp, Glu or Gin; Xaa(22) represent Leu, Val, Met or He; Xaa(23) represents Phe, 
Tyr, Thr, His or Tip; Xaa(24) represents He, Val, Leu or Met; .Xaa(25) represents Met, Cys, He, 
Leu, Val, Thr or Ser; Xaa(26) represents Leu, Val, Met, Thr or Ser. In an even more expansive 
library, each X can be selected from any amino acid. 

e . In similar fashion, alignment of each of the human, mouse, chicken and zebrafish 

~! 15 hedgehog clones, can provide a degenerate polypeptide sequence represented by the general 
jr formula: 

C-G-P-G-R-G-X(l)-X(2)-X(3)-R-M 
Y-K-Q-F-X(10)-P-X(l 1)-X^ 
X(17)-X(18)-X(19)-R-X(20^ 

D-E-E-N.X(23)-G-A-D-R-L-M-T-X(24)-R-C-K-X(25)-X(26)-X(27)-N-X(28)-L- 
A-I-S-V-M-N-X(29)-W-P-G-V-X(30)-L-R-V-T-E-G-X(31)-D-E-D-G-H-H- 
X(32)-X(33)-X(34)-S-L-H-Y-E-G-R-A-X(35)-D-I-T-T-S-D-R-D-X(36)-X(37)-K- 
Y-G-X(38)-L.X(39)-R-L-A-V-E-A-G-F-D-W.V-Y-Y-E-S-X(40)-X(41)-H-X(42). 
H-X(43) : S-V-K-X(44)-X(45) (SEQIDNo:22 

25 wherein, as above, each of the degenerate positions "X" can be an amino acid which occurs in a 
corresponding position in one of the wild-type clones, and may also include amino acid residue 
which would be conservative substitutions, or each X can be any amino acid residue. In an 
exemplary embodiment, Xaa(l) represents Gly, Ala, Val, Leu, He, Pro, Phe or Tyr; Xaa(2) 
represents Gly, Ala, Val, Leu or He; Xaa(3) represents Gly, Ala, Val, Leu, He, Lys, His or Arg; 
30 Xaa(4) represents Lys, Arg or His; Xaa(5) represents Phe, Trp, Tyr or an amino acid gap; Xaa(6) 
represents Gly, Ala, Val, Leu, He or an amino acid gap; Xaa(7) represents Asn, Gin, His, Arg or 
Lys; Xaa(8) represents Gly, Ala, Val, Leu, He, Ser or Thr; Xaa(9) represents Gly, Ala, Val, Leu, 
He, Ser or Thr; Xaa(10) represents Gly, Ala, Val, Leu, He, Ser or Thr; Xaa(ll) represents Ser, 
Thr, Gin or Asn; Xaa(12) represents Met, Cys, Gly, Ala, Val, Leu, He, Ser or Thr; Xaa(13) 
35 represents Gly, Ala, Val, Leu, He or Pro; Xaa(14) represents Arg, His or Lys; Xaa(15) represents 
Gly, Ala, Val, Leu, He, Pro, Arg, His or Lys; Xaa(16) represents Gly, Ala, Val, Leu, He, Phe or 
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Tyr; Xaa(17) represents Arg, His or Lys; Xaa(18) represents Gly, Ala, Val, Leu, He, Ser or Thr; 
Xaa(19) represents Thr or Ser; Xaa(20) represents Gly, Ala, Val, Leu, He, Asn or Gin; Xaa(21) 
represents Arg, His or Lys; Xaa(22) represents Asp or Glu; Xaa(23) represents Ser or Thr; 
Xaa(24) represents Glu, Asp, Gin or Asn; Xaa(25) represents Glu or Asp; Xaa(26) represents 
5 Arg, His or Lys; Xaa(27) represents Gly, Ala, Val, Leu or He; Xaa(28) represents Gly, Ala, Val, 
Leu, He, Thr or Ser; Xaa(29) represents Met, Cys, Gin, Asn, Arg, Lys or His; Xaa(30) represents 
Arg, His or Lys; Xaa(31) represents Trp, Phe, Tyr, Arg, His or Lys; Xaa(32) represents Gly, Ala, 
Val, Leu, lie, Ser, Thr, Tyr or Phe; Xaa(33) represents Gin, Asn, Asp or Glu; Xaa(34) represents 
Asp or Glu; Xaa(35) represents Gly, Ala, Val, Leu, or He; Xaa(36) represents Arg, His or Lys; 
10 Xaa(37) represents Asn, Gin, Thr or Ser; Xaa(38) represents Gly, Ala, Val, Leu, He, Ser, Thr, 
Met or Cys; Xaa(39) represents Gly, Ala, Val, Leu, He, Thr or Ser; Xaa(40) represents Arg, His 
or Lys; Xaa(41) represents Asn, Gin, Gly, Ala, Val, Leu or He; Xaa(42) represents Gly, Ala, Val, 
Leu or He; Xaa(43) represents Gly, Ala, Val, Leu, He, Ser, Thr or Cys; Xaa(44) represents Gly, 
J Ala, Val, Leu, He, Thr or Ser; and Xaa(45) represents Asp or Glu. 

15 There are many ways by which the library of potential hedgehog homologs can be 

4S generated from a degenerate oligonucleotide sequence. Chemical synthesis of a degenerate gene 

J3 

^ sequence can be earned out in an automatic DNA synthesizer, and the synthetic genes then 
rp ligated into an appropriate expression vector. The purpose of a degenerate set of genes is to 
r s provide, in one mixture, all of the sequences encoding the desired set of potential hedgehog 
u 20 sequences. The synthesis of degenerate oligonucleotides is well known in the art (see for 
JV example, Narang, SA (1983) Tetrahedron 39:3; Itakura et al. (1981) Recombinant DNA t Proc 
™ 3rd Cleveland Sympos. Macromolecules, ed. AG Walton, Amsterdam: Elsevier pp273-289; 
D Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 198:1056; Bee et 
al. (1983) Nucleic Acid Res. 11:477. Such techniques have been employed in the directed 
25 evolution of other proteins (see, for example, Scott et al. (1990) Science 249:386-390; Roberts et 
al. (1992) PNAS 89:2429-2433; Devlin et al. (1990) Science 249: 404-406; Cwirla et al. (1990) 
PNAS 87: 6378-6382; as well as U.S. Patents Nos. 5,223,409, 5,198,346, and 5,096,815). 

A wide range of techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations, and for screening cDNA libraries for gene 

30 products having a certain property. Such techniques will be generally adaptable for rapid 
screening of the gene libraries generated by the combinatorial mutagenesis of hedgehog 
homologs. The most widely used techniques for screening large gene libraries typically 
comprises cloning the gene library into replicable expression vectors, transforming appropriate 
cells with the resulting library of vectors, and expressing the combinatorial genes under 

35 conditions in which detection of a desired activity facilitates relatively easy isolation of the 
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vector encoding the gene whose product was detected. Each of the illustrative assays described 
below are amenable to high through-put analysis as necessary to screen large numbers of 
degenerate hedgehog sequences created by combinatorial mutagenesis techniques. 

In one embodiment, the combinatorial library is designed to be secreted (e.g. the 
5 polypeptides of the library all include a signal sequence but no transmembrane or cytoplasmic 
domains), and is used to transfect a eukaryotic cell that can be co-cultured with peripehral nerve 
cells. A functional hedgehog protein secreted by the cells expressing the combinatorial library 
will diffuse to neighboring T lymphocytes and induce a particular biological response, such as 
proliferation or differentiation. The pattern of detection of such a change in phenotype will 
10 resemble a gradient function, and will allow the isolation (generally after several repetitive 
rounds of selection) of cells producing hedgehog homologs active as neurotrophic agents. 
Likewise, hedgehog antagonists can be selected in similar fashion by the ability of the cell 
f -*% producing a functional antagonist to protect neighboring cells (e.g., to inhibit proliferation) from 
v3 the effect of wild-type hedgehog added to the culture media. 

r=J 15 To illustrate, target T lymphocytes are cultured in 24-well microtitre plates. Other 

+: eukaryotic cells are transfected with the combinatorial hedgehog gene library and cultured in cell 
01 culture inserts (e.g. Collaborative Biomedical Products, Catalog #40446) that are able to fit into 
=F the wells of the microtitre plate. The cell culture inserts are placed in the wells such that 
y, recombinant hedgehog homologs secreted by the cells in the insert can diffuse through the porous 
K fc 20 bottom of the insert and contact the target cells in the microtitre plate wells. After a period of 
ji" time sufficient for functional forms of a hedgehog protein to produce a measurable response in 
Q the target cells, such as growth state, the inserts are removed and the effect of the variant 
^ hedgehog proteins on the target cells determined. Cells from the inserts corresponding to wells 
which score positive for activity can be split and re-cultured on several inserts, the process being 
25 repeated until the active clones are identified. 

In yet another screening assay, the candidate hedgehog gene products are displayed on the 
surface of a cell or viral particle, and the ability of particular cells or viral particles to associate 
with a hedgehog-binding moiety (such as the patched protein or other hedgehog receptor) via this 
gene product is detected in a "panning assay". Such panning steps can be carried out on cells 
30 cultured from embryos. For instance, the gene library can be cloned into the gene for a surface 
membrane protein of a bacterial cell, and the resulting fusion protein detected by panning 
(Ladner et al., WO 88/06630; Fuchs et al. (1991) Bio/Technology 9:1370-1371; and Goward et 
al. (1992) TIBS 18:136-140). In a similar fashion, fluorescently labeled molecules which bind 
hedgehog can be used to score for potentially functional hedgehog homologs. Cells can be 
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visually inspected and separated under a fluorescence microscope, or, where the morphology of 
the cell permits, separated by a fluorescence-activated cell sorter. 

In an alternate embodiment, the gene library is expressed as a fusion protein on the 
surface of a viral particle. For instance, in the filamentous phage system, foreign peptide 
5 sequences can be expressed on the surface of infectious phage, thereby conferring two significant 
benefits. First, since these phage can be applied to affinity matrices at very high concentrations, 
large number of phage can be screened at one time. Second, since each infectious phage displays 
the combinatorial gene product on its surface, if a particular phage is recovered from an affinity 
matrix in low yield, the phage can be amplified by another round of infection. The group of 
10 almost identical Exoli filamentous phages Ml 3, fd, and fl are most often used in phage display 
libraries, as either of the phage gin or gVIII coat proteins can be used to generate fusion proteins 
without disrupting the ultimate packaging of the viral particle (Ladner et al. PCT publication WO 
^ 90/02909; Garrard et al., PCT publication WO 92/09690; Marks et al. (1992) J. Biol. Chem. 
3 267:16007-16010; Griffths et al. (1993) EMBO J 12:725-734; Clackson et al. (1991) Nature 

y 15 352:624-628; and Barbas et al. (1992) PNAS 89:4457-4461). 

\u 

1: In an illustrative embodiment, the recombinant phage antibody system (RPAS, 

m Pharamacia Catalog number 27-9400-01) can be easily modified for use in expressing and 
screening hedgehog combinatorial libraries. For instance, the pCANTAB 5 phagemid of the 
j^. RPAS kit contains the gene which encodes the phage gin coat protein. The hedgehog 
20 combinatorial gene library can be cloned into the phagemid adjacent to the gin signal sequence 

n\ 

such that it will be expressed as a gin fusion protein. After ligation, the phagemid is used to 
p transform competent E. coli TGI cells. Transformed cells are subsequently infected with 
M M13K07 helper phage to rescue the phagemid and its candidate hedgehog gene insert. The 
resulting recombinant phage contain phagemid DNA encoding a specific candidate hedgehog, 
25 and display one or more copies of the corresponding fusion coat protein. The phage-displayed 
candidate hedgehog proteins which are capable of binding an hedgehog receptor are selected or 
enriched by panning. For instance, the phage library can be applied to cells which express the 
patched protein and unbound phage washed away from the cells. The bound phage is then 
isolated, and if the recombinant phage express at least one copy of the wild type gin coat protein, 
30 they will retain their ability to infect E. coli. Thus, successive rounds of reinfection of E. coli, 
and panning will greatly enrich for hedgehog homologs, which can then be screened for further 
biological activities in order to differentiate agonists and antagonists. 

Combinatorial mutagenesis has a potential to generate very large libraries of mutant 
proteins, e.g., in the order of 10 26 molecules. Combinatorial libraries of this size may be 
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technically challenging to screen even with high throughput screening assays such as phage 
display. To overcome this problem, a new technique has been developed recently, recursive 
ensemble mutagenesis (REM), which allows one to avoid the very high proportion of non- 
functional proteins in a random library and simply enhances the frequency of functional proteins, 
5 thus decreasing the complexity required to achieve a useful sampling of sequence space. REM is 
an algorithm which enhances the frequency of functional mutants in a library when an appropriate 
selection or screening method is employed (Arkin and Yourvan, 1992, PNAS USA 89:781 1-7815; 
Yourvan et al., 1992, Parallel Problem Solving from Nature, 2., In Maenner and Manderick, eds., 
Elsevir Publishing Co., Amsterdam, pp. 401-410; Delgrave et al., 1993, Protein Engineering 
10 6(3):327-331). 

The invention also provides for reduction of the hedgehog protein to generate mimetics, 
e.g. peptide or non-peptide agents, which are able to disrupt binding of a hedgehog polypeptide 
of the present invention with an hedgehog receptor. Thus, such mutagenic techniques as 
y3. described above are also useful to map the determinants of the hedgehog proteins which 

J: 15 participate in protein-protein interactions involved in, for example, binding of the subject 

i y 

j= hedgehog polypeptide to other extracellular matrix components. To illustrate, the critical 
«f residues of a subject hedgehog polypeptide which are involved in molecular recognition of an 
jg hedgehog receptor such as patched can be determined and used to generate hedgehog-derived 
f peptidomimetics which competitively inhibit binding of the authentic hedgehog protein with that 

20 moiety. By employing, for example, scanning mutagenesis to map the amino acid residues of 
each of the subject hedgehog proteins which are involved in binding other extracellular proteins, 
peptidomimetic compounds can be generated which mimic those residues of the hedgehog 
protein which facilitate the interaction. Such mimetics may then be used to interfere with the 
normal function of a hedgehog protein. For instance, non-hydrolyzable peptide analogs of such 
25 residues can be generated using benzodiazepine (e.g., see Freidinger et al. in Peptides: Chemistry 
and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), azepine (e.g., 
see Huffman et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: 
Leiden, Netherlands, 1988), substituted gama lactam rings (Garvey et al. in Peptides: Chemistry 
and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), keto-methylene 
30 pseudopeptides (Ewenson et al. (1986) J Med Chem 29:295; and Ewenson et al. in Peptides: 
Structure and Function (Proceedings of the 9th American Peptide Symposium) Pierce Chemical 
Co. Rockland, IL, 1985), P-turn dipeptide cores (Nagai et al. (1985) Tetrahedron Lett 26:647; 
and Sato et al. (1986) J Chem Soc Perkin Trans 1:1231), and P-aminoalcohols (Gordon et al. 
(1985) Biochem Biophys Res Commun\26'A\9\ and Dann et al. (1986) Biochem Biophys Res 
35 Commun 134:71). 



-33 - 



ONV-080.01 




Recombinantly produced forms of the hedgehog proteins can be produced using, e.g, 
expression vectors containing a nucleic acid encoding a hedgehog polypeptide, operably linked 
to at least one transcriptional regulatory sequence. Operably linked is intended to mean that the 
nucleotide sequence is linked to a regulatory sequence in a manner which allows expression of 
5 the nucleotide sequence. Regulatory sequences are art-recognized and are selected to direct 
expression of a hedgehog polypeptide. Accordingly, the term transcriptional regulatory sequence 
includes promoters, enhancers and other expression control elements. Such regulatory sequences 
are described in Goeddel; Gene Expression Technology: Methods in Enzymology 185, Academic 
Press, San Diego, CA (1990). For instance, any of a wide variety of expression control 
10 sequences, sequences that control the expression of a DNA sequence when operatively linked to 
it, may be used in these vectors to express DNA sequences encoding hedgehog polypeptide. 
Such useful expression control sequences, include, for example, a viral LTR, such as the LTR of 
the Moloney murine leukemia virus, the early and late promoters of SV40, adenovirus or 

■ ■ 

%. cytomegalovirus immediate early promoter, the lac system, the tip system, the TAC or TRC 
%j 15 system, T7 promoter whose expression is directed by T7 RNA polymerase, the major operator 
*{f and promoter regions of phage X , the control regions for fd coat protein, the promoter for 3- 
; h phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid phosphatase, e.g., 
EH Pho5, the promoters of the yeast a-mating factors, the polyhedron promoter of the baculovirus 
g ' system and other sequences known to control the expression of genes of prokaryotic or 

M 20 eukaryotic cells or their viruses, and various combinations thereof. It should be understood that 
£i the design of the expression vector may depend on such factors as the choice of the host cell to 
S3 be transformed and/or the type of protein desired to be expressed. Moreover, the vector's copy 

number, the ability to control that copy number and the expression of any other proteins encoded 

by the vector, such as antibiotic markers, should also be considered. 

25 In addition to providing a ready source of hedgehog polypeptides for purification, the 

gene constructs of the present invention can also be used as a part of a gene therapy protocol to 
deliver nucleic acids encoding either an agonistic or antagonistic form of a hedgehog 
polypeptide. Thus, another aspect of the invention features expression vectors for in vivo 
transfection of a hedgehog polypeptide in particular cell types so as cause ectopic expression of a 

30 hedgehog polypeptide in an T lymphocytes or other cells associated therewith. 

Formulations of such expression constructs may be administered in any biologically 
effective carrier, e.g. any formulation or composition capable of effectively delivering the 
recombinant gene to cells in vivo. Approaches include insertion of the hedgehog coding sequence 
in viral vectors including recombinant retroviruses, adenovirus, adeno-associated virus, and 
35 herpes simplex virus-1, or recombinant bacterial or eukaryotic plasmids. Viral vectors transfect 
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cells directly; plasmid DNA can be delivered with the help of, for example, cationic liposomes 
(lipofectin) or derivatized (e.g. antibody conjugated), polylysine conjugates, gramacidin S, 
artificial viral envelopes or other such intracellular carriers, as well as direct injection of the gene 
construct or CaP0 4 precipitation carried out in vivo. It will be appreciated that because 

5 transduction of appropriate target cells represents the critical first step in gene therapy, choice of 
the particular gene delivery system will depend on such factors as the phenotype of the intended 
target and the route of administration, e.g. locally or systemically. Furthermore, it will be 
recognized that the particular gene construct provided for in vivo transduction of hedgehog 
expression are also useful for in vitro transduction of cells, such as for use in the ex vivo tissue 
1 0 culture systems described below. 

A preferred approach for in vivo introduction of nucleic acid into a cell is by use of a viral 
vector containing nucleic acid, e.g. a cDNA, encoding the particular form of the hedgehog 
polypeptide desired. Infection of cells with a viral vector has the advantage that a large 
u! proportion of the targeted cells can receive the nucleic acid. Additionally, molecules encoded 
' «/; 15 within the viral vector, e.g., by a cDNA contained in the viral vector, are expressed efficiently in 
£ cells which have taken up viral vector nucleic acid. 

jjj Retrovirus vectors and adeno-associated virus vectors are generally understood to be the 

4 s recombinant gene delivery system of choice for the transfer of exogenous genes in vivo, 
particularly into humans. These vectors provide efficient delivery of genes into cells, and the 
H* 20 transferred nucleic acids are stably integrated into the chromosomal DNA of the host. A major 
Li prerequisite for the use of retroviruses is to ensure the safety of their use, particularly with regard 
B to the possibility of the spread of wild-type virus in the cell population. The development of 
M specialized cell lines (termed "packaging cells") which produce only replication-defective 
retroviruses has increased the utility of retroviruses for gene therapy, and defective retroviruses 
25 are well characterized for use in gene transfer for gene therapy purposes (for a review see Miller, 
A.D. (1990) Blood 76:271). Thus, recombinant retrovirus can be constructed in which part of 
the retroviral coding sequence (gag, pol, env) has been replaced by nucleic acid encoding a 
hedgehog polypeptide and renders the retrovirus replication defective. The replication defective 
retrovirus is then packaged into virions which can be used to infect a target cell through the use 
30 of a helper virus by standard techniques. Protocols for producing recombinant retroviruses and 
for infecting cells in vitro or in vivo with such viruses can be found in Current Protocols in 
Molecular Biology, Ausubel, F.M. et al. (eds.) Greene Publishing Associates, (1989), Sections 
9.10-9.14 and other standard laboratory manuals. Examples of suitable retroviruses include pLJ, 
pZIP, pWE and pEM which are well known to those skilled in the art. Examples of suitable 
35 packaging virus lines for preparing both ecotropic and amphotropic retroviral systems include 
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Crip, Cre, 2 and Am. Retroviruses have been used to introduce a variety of genes into many 
different cell types, including T lymphocytes, in vitro and/or in vivo (see for example Eglitis, et 
al. (1985) Science 230:1395-1398; Danos and Mulligan (1988) Proc. Natl Acad, Set USA 
85:6460-6464; Wilson et al. (1988) Proc, Natl Acad. Set USA 85:3014-3018; Armentano et al. 

(1990) Proc. Natl Acad Sci. USA 87:6141-6145; Huber et al. (1991) Proc. Natl Acad Set USA 
88:8039-8043; Ferry et al. (1991) Proc. Natl Acad Set USA 88:8377-8381; Chowdhury et al. 

(1991) Science 254:1802-1805; van Beusechem et al. (1992) Proa Natl Acad. Sci. USA 
89:7640-7644; Kay et al. (1992) Human Gene Therapy 3:641-647; Dai et al. (1992) Proc. Natl 
Acad Sci. USA 89:10892-10895; Hwu et al. (1993)7. Immunol 150:4104-4115; U.S. Patent No. 
4,868,116; U.S. Patent No. 4,980,286; PCT Application WO 89/07136; PCT Application WO 
89/02468; PCT Application WO 89/05345; and PCT Application WO 92/07573). 

Furthermore, it has been shown that it is possible to limit the infection spectrum of 
retroviruses and consequently of retroviral-based vectors, by modifying the viral packaging 
proteins on the surface of the viral particle (see, for example PCT publications W093/25234 and 
WO94/06920). For instance, strategies for the modification of the infection spectrum of 
retroviral vectors include: coupling antibodies specific for cell surface antigens to the viral env 
protein (Roux et al. (1989) PNAS 86:9079-9083; Man et al. (1992) J. Gen Virol 73:3251-3255; 
and Goud et al. (1983) Virology 163:251-254); or coupling cell surface receptor ligands to the 
viral env proteins (Neda et al. (1991) J Biol Chem 266:14143-14146). Coupling can be in the 
form of the chemical cross-linking with a protein or other variety (e.g. lactose to convert the env 
protein to an asialoglycoprotein), as well as by generating fusion proteins (e.g. single-chain 
antibody/en v fusion proteins). This technique, while useful to limit or otherwise direct the 
infection to certain tissue types, can also be used to convert an ecotropic vector in to an 
amphotropic vector. 

Moreover, use of retroviral gene delivery can be further enhanced by the use of tissue- or 
cell-specific transcriptional regulatory sequences which control expression of the hedgehog gene 
of the retroviral vector. 

Another viral gene delivery system useful in the present method utilizes adenovirus- 
derived vectors. The genome of an adenovirus can be manipulated such that it encodes and 
expresses a gene product of interest but is inactivated in terms of its ability to replicate in a 
normal lytic viral life cycle. See for example Berkner et al. (1988) BioTechniques 6:616; 
Rosenfeld et al. (1991) Science 252:431-434; and Rosenfeld et al. (1992) Cell 68:143-155. 
Suitable adenoviral vectors derived from the adenovirus strain Ad type 5 dl324 or other strains of 
adenovirus (e.g., Ad2, Ad3, Ad7 etc.) are well known to those skilled in the art. Recombinant 
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adenoviruses can be advantageous in certain circumstances in that they can be used to infect a 
wide variety of cell types, including T lymphocytes. Furthermore, the virus particle is relatively 
stable and amenable to purification and concentration, and as above, can be modified so as to 
affect the spectrum of infectivity. Additionally, introduced adenoviral DNA (and foreign DNA 
contained therein) is not integrated into the genome of a host cell but remains episomal, thereby 
avoiding potential problems that can occur as a result of insertional mutagenesis in situations 
where introduced DNA becomes integrated into the host genome (e.g., retroviral DNA). 
Moreover, the carrying capacity of the adenoviral genome for foreign DNA is large (up to 8 
kilobases) relative to other gene delivery vectors (Berkner et al. cited supra; Haj-Ahmand and 
Graham (1986) J, Virol. 57:267). Most replication-defective adenoviral vectors currently in use 
and therefore favored by the present invention are deleted for all or parts of the viral El and E3 
genes but retain as much as 80% of the adenoviral genetic material (see, e.g., Jones et al. (1979) 
Cell 16:683; Berkner et al., supra; and Graham et al. in Methods in Molecular Biology. E.J. 
Murray, Ed. (Humana, Clifton, NJ, 1991) vol. 7. pp. 109-127). Expression of the inserted 
hedgehog gene can be under control of, for example, the El A promoter, the major late promoter 
(MLP) and associated leader sequences, the E3 promoter, or exogenously added promoter 
sequences. 

In addition to viral transfer methods, such as those illustrated above, non-viral methods 
can also be employed to cause expression of 2l hedgehog polypeptide in the tissue of an. animal. 
Most nonviral methods of gene transfer rely on normal mechanisms used by mammalian cells for 
the uptake and intracellular transport of macromolecules. In preferred embodiments, non- viral 
gene delivery systems of the present invention rely on endocytic pathways for the uptake of the 
hedgehog polypeptide gene by the targeted cell. Exemplary gene delivery systems of this type 
include liposomal derived systems, poly-lysine conjugates, and artificial viral envelopes. 

In clinical settings, the gene delivery systems for the therapeutic hedgehog gene can be 
introduced into a patient by any of a number of methods, each of which is familiar in the art. For 
instance, a pharmaceutical preparation of the gene delivery system can be introduced 
systemically, e.g. by intravenous injection, and specific transduction of the protein in the target 
cells occurs predominantly from specificity of transfection provided by the gene delivery vehicle, 
cell-type or tissue-type expression due to the transcriptional regulatory sequences controlling 
expression of the receptor gene, or a combination thereof. In other embodiments, initial delivery 
of the recombinant gene is more limited with introduction into the animal being quite localized. 
For example, the gene delivery vehicle can be introduced by catheter (see U.S. Patent 5,328,470) 
or by stereotactic injection (e.g. Chen et al. (1994) PNAS 91: 3054-3057). A hedgehog 
expression construct can be delivered in a gene therapy construct to dermal cells by, e.g., 
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electroporation using techniques described, for example, by Dev et al. ((1994) Cancer Treat Rev 
20:105-115). 

The pharmaceutical preparation of the gene therapy construct can consist essentially of 
the gene delivery system in an acceptable diluent, or can comprise a slow release matrix in which 
5 the gene delivery vehicle is imbedded. Alternatively, where the complete gene delivery system 
can be produced intact from recombinant cells, e.g. retroviral vectors, the pharmaceutical 
preparation can comprise one or more cells which produce the gene delivery system. 

In yet another embodiment, the hedgehog or ptc therapeutic can be a "gene activation" 
construct which, by homologous recombination with a genomic DNA, alters the transcriptional 

10 regulatory sequences of an endogenous gene. For instance, the gene activation construct can 
replace the endogenous promoter of a hedgehog gene with a heterologous promoter, e.g., one 
which causes consitutive expression of the hedgehog gene or which causes inducible expression 
of the gene under conditions different from the normal expression pattern of the gene. Other 
genes in the patched signaling pathway can be similarly targeted. A vareity of different formats 

15 for the gene activation constructs are available. See, for example, the Transkaryotic Therapies, 
Inc PCT publications WO93/09222, WO95/31560, W096/29411, WO95/31560 and 
WO94/12650. 

In preferred embodiments, the nucleotide sequence used as the gene activation construct 
can be comprised of (1) DNA from some portion of the endogenous hedgehog gene (exon 
20 sequence, intron sequence, promoter sequences, etc.) which direct recombination and (2) 
heterologous transcriptional regulatory sequence(s) which is to be operably linked to the coding 
sequence for the genomic hedgehog gene upon recombination of the gene activation construct. 
For use in generating cultures of hedgehog producing cells, the construct may further include a 
reporter gene to detect the presence of the knockout construct in the cell. 

25 The gene activation construct is inserted into a cell, and integrates with the genomic DNA 

of the cell in such a position so as to provide the heterologous regulatory sequences in operative 
association with the native hedgehog gene. Such insertion occurs by homologous recombination, 
i.e., recombination regions of the activation construct that are homologous to the endogenous 
hedgehog gene sequence hybridize to the genomic DNA and recombine with the genomic 

30 sequences so that the construct is incorporated into the corresponding position of the genomic 
DNA. 

The terms "recombination region" or "targeting sequence" refer to a segment (i.e., a 
portion) of a gene activation construct having a sequence that is substantially identical to or 
substantially complementary to a genomic gene sequence, e.g., including 5' flanking sequences 
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of the genomic gene, and can facilitate homologous recombination between the genomic 
sequence and the targeting transgene construct. 

As used herein, the term "replacement region" refers to a portion of a activation construct 
which becomes integrated into an endogenous chromosomal location following homologous 
5 recombination between a recombination region and a genomic sequence. 

The heterologous regulatory sequences, e.g., which are provided in the replacement 
region, can include one or more of a variety elements, including: promoters (such as constitutive 
or inducible promoters), enhancers, negative regualtory elements, locus control regions, 
transcription factor binding sites, or combinations thereof. Promoters/enhancers which may be 
10 used to control the expression of the targeted gene in vivo include, but are not limited to, the 
cytomegalovirus (CMV) promoter/enhancer (Karasuyama et ah, 1989, J. Exp. Med., 169:13), the 
human p-actin promoter (Gunning et ah (1987) PNAS 84:4831-4835), the glucocorticoid- 

I inducible promoter present in the mouse mammary tumor virus long terminal repeat (MMTV 
LTR) (Klessig et ah (1984) Mol Cell Biol 4:1354-1362), the long terminal repeat sequences of 

1 15 Moloney murine leukemia virus (MuLV LTR) (Weiss et ah (1985) RNA Tumor Viruses, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New York), the SV40 early or late region 
promoter (Bernoist et ah (1981) Nature 290:304-310; Templeton et ah (1984) Mol Cell Biol, 
4:817; and Sprague et ah (1983) J. Virol, 45:773), the promoter contained in the 3' long terminal 
repeat of Rous sarcoma virus (RSV) (Yamamoto et ah, 1980, Cell, 22:787-797), the herpes 

20 simplex virus (HSV) thymidine kinase promoter/enhancer (Wagner et ah (1981) PNAS 82:3567- 
71), and the herpes simplex virus LAT promoter (Wolfe et ah (1992) Nature Genetics, 1:379- 
384). 

In an exemplar)! embodiment, portions of the 5' flanking region of the human Shh gene 
are amplified using prin ers which add restriction sites, to generate the following fragments 

5 ' -gcgcgcttcgaaGCGAGGCAGCCAGCGAGGGAGAGAGCGAGCGGGCGAGCCGGAGC- 
GAGGAAatcgatgcgcic (primer 1) ' 

5 ' -gcgcgcagatctGCpGAAAGCGCAAGAGAGAGCGCACACGCACACACCCGCCGCGCG- 
CACTCGggatccgcgcc 




30 



c (primer 2) 



As illustrated, primer 1 includes a 5' non-coding region of the human Shh gene and is flanked by 
an AsuII and Clal restriction sites. Primer 2 includes a portion of the 5' non-coding region 
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immediately 3' to that present in primer 1. The hedgehog gene sequence is flanked by XhoII and 
BamHI restriction sites. The purified amplimers are cut with each of the enzymes as appropriate. 

The vector pCDNAl.l (Invitrogen) includes a CMV promoter. The plasmid is cut with 
with AsuII, which cleaves just 3' to the CMV promoter sequence. The AsuH/Clal fragment of 
primer 1 is ligated to the AsuII cleavage site of the pcDNA vector. The Clal/ AsuII ligation 
destroys the AsuII site at the 3' end of a properly inserted primer 1. 

The vector is then cut with BamHI, and an XhoE/BamHI fragment of primer 2 is ligated 
to the BamHI cleavage site. As above, the BamHI/XhoII ligation destroys the BamHI site at the 
5' end of a properly inserted primer 2. 

. >^^> Individual colonics are selected, cut with AsuII and BamHI, and the size of the 
^suD/BamHI fragment determined. Colonies in which both the primer 1 and primer 2 sequences 
are correctly inserted are mrther amplified, an cut with AsuII and BamHI to produce the gene 
activation construct 

cgaagcgaggcagccagcgfegggagagagcgagcgggcgagccggagcgaggaaATCGAAGGTTC 
GAATCCTTCCCCCACCACCATCACTTTCAAAAGTCCGAAAGAATCTGCTCCCTGCTTGTGTGTTG 
GAGGTCGCTGAGTAGTGCGC3GAGTAAAATTTAAGCTACAACAAGGCAAGGCTTGACCGACAATTG 
CATGAAGAATCTGCTTAGGOTTAGGCGTTTTGCGCTGCTTCGCGATGTACGGGCCAGATATACGC 
GTTGACATTGATTATTGACTkGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCA 
TATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCC 
CCGCCCATTGACGTCAATAA'BGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGAC 
GTCAATGGGTGGACTATTTAOGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCA 
AGTACGCCCCCTATTGACGTckATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGAC 
CTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGC 
GGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCAC 
CCCATTGACGTCAATGGGAGT*] TGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAA 
CAACTCCGCCCCATTGACGCA^ATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAG 
CTCTCTGGCTAACTAGAGAACC CACTGCTTACTGGCTTATCGAAATTAATACGACTCACTATAGG 
GAGACCCAAGCTTGGTACCGAGCTCGGATCgatctgggaaagcgcaagagagagcgcacacgcac 
acacccgccgcgcgcactcgg 

In this construct, the flanking primer 1 and primer 2 sequences provide the recombination region 
which permits the insertion of the CMV promoter in front of the coding sequence for the human 
Shh gene. Other heterologous promoters (or other transcriptional regulatory sequences) can be 
inserted in a genomic hedgehog gene by a similar method. 
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In still other embodiments, the replacement region merely deletes a negative 
transcriptional control element of the native gene, e.g., to activate expression, or ablates a 
positive control element, e.g., to inhibit expression of the targeted gene. 

5 V. Exemplary ptc therapeutic compounds. 

In another embodiment, the subject method is carried out using a ptc therapeutic 
composition. Such compositions can be generated with, for example, compounds which bind to 
patched and alter its signal transduction activity, compounds which alter the binding and/or 
enzymatic activity of a protein (e.g., intracellular) involved in patched signal pathway, and 
10 compounds which alter the level of expression of a hedgehog protein, a patched protein or a 
protein involved in the intracellular signal transduction pathway of patched. 

□ The availability of purified and recombinant hedgehog polypeptides facilitates the 

*m generation of assay systems which can be used to screen for drugs, such as small organic 

fu molecules, which are either agonists or antagonists of the normal cellular function of a hedgehog 

15 and/or patched protein, particularly their role in the pathogenesis of peripheral nerve proliferation 

gS and/or differentiation. In one embodiment, the assay evaluates the ability of a compound to 

4- modulate binding between a hedgehog polypeptide and a hedgehog receptor such as patched. In 

y other embodiments, the assay merely scores for the ability of a test compound to alter the signal 

U transduction acitity of the patched protein. In this manner, a variety of hedgehog and/or ptc 

j~ 20 therapeutics, both agonists (inhibit T-cell maturation) and antagonist (promote T-cell maturation) 

q can be identified. A variety of assay formats will suffice and, in light of the present disclosure, 

Q will be comprehended by skilled artisan. 

In many drug screening programs which test libraries of compounds and natural extracts, 
high throughput assays are desirable in order to maximize the number of compounds surveyed in 

25 a given period of time. Assays which are performed in cell-free systems, such as may be derived 
with purified or semi-purified proteins, are often preferred as "primary" screens in that they can 
be generated to permit rapid development and relatively easy detection of an alteration in a 
molecular target which is mediated by a test compound. Moreover, the effects of cellular 
toxicity and/or bioavailability of the test compound can be generally ignored in the in vitro 

30 system, the assay instead being focused primarily on the effect of the drug on the molecular 
target as may be manifest in an alteration of binding affinity with receptor proteins. 

Acordingly, in an exemplary screening assay for ptc therapeutics, the compound of 
interest is contacted with a mixture including a hedgehog receptor protein (e.g., a cell expressing 
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the patched receptor) and a hedgehog protein under conditions in which it is ordinarily capable 
of binding the hedgehog protein. To the mixture is then added a composition containing a test 
compound. Detection and quantification of receptor/ hedgehog complexes provides a means for 
determining the test compound's efficacy at inhibiting (or potentiating) complex formation 
5 between the receptor protein and the hedgehog polypeptide. The efficacy of the compound can 
be assessed by generating dose response curves from data obtained using various concentrations 
of the test compound. Moreover, a control assay can also be performed to provide a baseline for 
comparison. In the control assay, isolated and purified hedgehog polypeptide is added to the 
receptor protein, and the formation of receptor/ hedgehog complex is quantitated in the absence 
10 of the test compound. 

In other embodiments, a ptc therapeutic of the present invention is one which disrupts the 
association of patched with smoothened. 

Agonist and antagonists of peripheral nerve maintanence can be distinguished, and the 
efficacy of the compound can be assessed, by subsequent testing with T lymphocytes, e.g., in 
fU 15 culture. 



*M In an illustrative embodiment, the polypeptide utilized as a hedgehog receptor can be 

J* generated from the patched protein. Accordingly, an exemplary screening assay includes all or a 
suitable portion of the patched protein which can be obtained from, for example, the human 
f7 patched gene (GenBank U43148) or other vertebrate sources (see GenBank Accession numbers 
FU 20 U40074 for chicken patched and U46155 for mouse patched), as well as from drosophila 
(GenBank Accession number M28999) or other invertebrate sources. The patched protein can 
be provided in the screening assay as a whole protein (preferably expressed on the surface of a 
cell), or alternatively as a fragment of the full length protein which binds to hedgehog 
polypeptides, e.g., as one or both of the substantial extracellular domains (e.g. corresponding to 
25 residues Asnl20-Ser438 and/or Arg770-Trpl027 of the human patched protein - which are also 
potential antagonists of hedgehog-dependent signal transduction). For instance, the patched 
protein can be provided in soluble form, as for example a preparation of one of the extracellular 
domains, or a preparation of both of the extracellular domains which are covalently connected by 
an unstructured linker (see, for example, Huston et al. (1988) PNAS 85:4879; and U.S. Patent 
30 No. 5,091,513). In other embodiments, the protein can be provided as part of a liposomal 
preparation or expressed on the surface of a cell. The patched protein can derived from a 
recombinant gene, e.g., being ectopically expressed in a heterologous cell. For instance, the 
protein can be expressed on oocytes, mammalian cells (e.g., COS, CHO, 3T3 or the like), or 
yeast cell by standard recombinant DNA techniques. These recombinant cells can be used for 



-42- 



ONV-080.01 



receptor binding, signal transduction or gene expression assays. Marigo et al. (1996) 
Development 122:1225-1233 illustrates a binding assay of human hedgehog to chick patched 
protein ectopically expressed in Xenopus laevis oocytes. The assay system of Marigo et al can 
be adapted to the present drug screening assays. As illustrated in that reference, Shh binds to the 
5 patched protein in a selective, saturable, dose-dependent mariner, thus demonstrating that 
patched is a receptor for Shh. 

Complex formation between the hedgehog polypeptide and a hedgehog receptor may be 
detected by a variety of techniques. For instance, modulation of the formation of complexes can 
be quantitated using, for example, detectably labelled proteins such as radiolabeled, 
10 fluorescently labelled, or enzymatically labelled hedgehog polypeptides, by immunoassay, or by 
chromatographic detection. 

Typically, for cell-free assays, it will be desirable to immobilize either the hedgehog 
receptor or the hedgehog polypeptide to facilitate separation of receptor/ hedgehog complexes 
-M from uncomplexed forms of one of the proteins, as well as to accommodate automation of the 
15 assay. In one embodiment, a fusion protein can be provided which adds a domain that allows the 
\Q protein to be bound to a matrix. For example, glutathione-S-transferase/receptor (GST/receptor) 
fusion proteins can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, 
MO) or glutathione derivatized microtitre plates, which are then combined with the hedgehog 
polypeptide, e.g. an 35 S-labeled hedgehog polypeptide, and the test compound and incubated 
jTJ 20 under conditions conducive to complex formation, e.g. at physiological conditions for salt and 
00 pH, though slightly more stringent conditions may be desired. Following incubation, the beads 
~ are washed to remove any unbound hedgehog polypeptide, and the matrix bead-bound radiolabel 
determined directly (e.g. beads placed in scintillant), or in the supernatant after the 
receptor/ hedgehog complexes are dissociated. Alternatively, the complexes can be dissociated 
25 from the bead, separated by SDS-PAGE gel, and the level of hedgehog polypeptide found in the 
bead fraction quantitated from the gel using standard electrophoretic techniques. 

Other techniques for immobilizing proteins on matrices are also available for use in the 
subject assay. For instance, soluble portions of the hedgehog receptor protein can be 
immobilized utilizing conjugation of biotin and streptavidin. For instance, biotinylated receptor 
30 molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using techniques well 
known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL), and immobilized in the 
wells of streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, antibodies reactive 
with the hedgehog receptor but which do not interfere with hedgehog binding can be derivatized 
to the wells of the plate, and the receptor trapped in the wells by antibody conjugation. As 
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above, preparations of a hedgehog polypeptide and a test compound are incubated in the 
receptor-presenting wells of the plate, and the amount of receptor//**? dgehog complex trapped in 
the well can be quantitated. Exemplary methods for detecting such complexes, in addition to 
those described above for the GST-immobilized complexes, include immunodetection of 
5 complexes using antibodies reactive with the hedgehog polypeptide, or which are reactive with 
the receptor protein and compete for binding with the hedgehog polypeptide; as well as enzyme- 
linked assays which rely on detecting an enzymatic activity associated with the hedgehog 
polypeptide. In the instance of the latter, the enzyme can be chemically conjugated or provided 
as a fusion protein with the hedgehog polypeptide. To illustrate, the hedgehog polypeptide can 
10 be chemically cross-linked or genetically fused with alkaline phosphatase, and the amount of 
hedgehog polypeptide trapped in the complex can be assessed with a chromogenic substrate of 
the enzyme, e.g. paranitrophenylphosphate. Likewise, a fusion protein comprising the hedgehog 
polypeptide and glutathione-S-transferase can be provided, and complex formation quantitated 

% by detecting the GST activity using l-chloro-2,4-dinitrobenzene (Habig et al (1974) J Biol Chem 

%j 15 249:7130). 

4* For processes which rely on immunodetection for quantitating one of the proteins trapped 

^ in the complex, antibodies against the protein, such as the anti-hedgehog antibodies described 

y = 

4* herein, can be used. Alternatively, the protein to be detected in the complex can be "epitope 

* tagged" in the form of a fusion protein which includes, in addition to the hedgehog polypeptide 

20 or hedgehog receptor sequence, a second polypeptide for which antibodies are readily available 

Iw 5 (e.g. from commercial sources). For instance, the GST fusion proteins described above can also 

q be used for quantification of binding using antibodies against the GST moiety. Other useful 

□ epitope tags include myc-epitopes (e.g., see Ellison et al. (1991) J Biol Chem 266:21 150-21 157) 
which includes a 10-residue sequence from c-myc, as well as the pFLAG system (International 

25 Biotechnologies, Inc.) or the pEZZ-protein A system (Pharamacia, NJ). 

Where the desired portion of the hedgehog receptor (or other hedgehog binding 
molecule) cannot be provided in soluble form, liposomal vesicles can be used to provide 
manipulatable and isolatable sources of the receptor. For example, both authentic and 
recombinant forms of the patched protein can be reconstituted in artificial lipid vesicles (e.g. 
30 phosphatidylcholine liposomes) or in cell membrane-derived vesicles (see, for example, Bear et 
al. (1992) Cell 68:809-818; Newton et al. (1983) Biochemistry 22:6110-6117; and Reber et al. 
(mi)JBiol Chem 262:11369-11374). 

In addition to cell-free assays, such as described above, the readily available source of 
hedgehog proteins provided by the art also facilitates the generation of cell-based assays for 
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identifying small molecule agonists/antagonists and the like. Analogous to the cell-based assays 
described above for screening combinatorial libraries, cells which are sensitive to hedgehog 
induction, e.g. /?a/c/ze</-expressing cells or other myoblast-derived cells sensitive to hedgehog 
induction, can be contacted with a hedgehog protein and a test agent of interest, with the assay 
scoring for anything from simple binding to the cell to modulation in hedgehog inductive 
responses by the target cell in the presence and absence of the test agent. As with the cell-free 
assays, agents which produce a statistically significant change in hedgehog activities (either 
inhibition or potentiation) can be identified. 

In other emdodiments, the cell-based assay scores for agents which disrupt association of 
patched and smoothened proteins, e.g., in the cell surface membrane or liposomal preparation. 

In addition to characterizing cells that naturally express the patched protein, cells which 
have been genetically engineered to ectopically express patched can be utilized for drug 
screening assays. As an example, cells which either express low levels or lack expression of the 
patched protein, e.g. Xenopus laevis oocytes, COS cells or yeast cells, can be genetically 
modified using standard techniques to ectopically express the patched protein, (see Marigo et 
al., supra). 

The resulting recombinant cells, e.g., which express a functional patched receptor, can be 
utilized in receptor binding assays to identify agonist or anatagonsts of hedgehog binding. 
Binding assays can be performed using whole cells. Furthermore, the recombinant cells of f the 
present invention can be engineered to include other heterolgous genes encoding proteins 
involved in hedgehog-dependent siganl pathways. For example, the gene products of one or 
more of smoothened, costal-2 and/or fused can be co-expressed with patched in the reagent cell, 
with assays being sensitive to the functional reconstituion of the hedgehog signal transduction 
cascade. 

Alternatively, liposomal preparations using reconstituted patched protein can be utilized. 
Patched protein purified from detergent extracts from both authentic and recombinant origins 
can be reconstituted in in artificial lipid vesicles (e.g. phosphatidylcholine liposomes) or in cell 
membrane-derived vesicles (see, for example, Bear et al. (1992) Cell 68:809-818; Newton et al. 
(1983) Biochemistry 22:61 10-6117; and Reber et al (1987) J Biol Chem 262: 1 1369-1 1374). The 
lamellar structure and size of the resulting liposomes can be characterized using electron 
microscopy. External orientation of the patched protein in the reconstituted membranes can be 
demonstrated, for example, by immunoelectron microscopy. The hedgehog protein binding 
activity of liposomes containing patched and liposomes without the protein in the presence of 
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candidate agents can be compared in order to identify potential modulators of the hedgehog- 
patched interaction. 

The hedgehog protein used in these cell-based assays can be provided as a purified source 
(natural or recombinant in origin), or in the form of cells/tissue which express the protein and 
5 which are co-cultured with the target cells. As in the cell-free assays, where simple binding 
(rather than induction) is the hedgehog activity scored for in the assay, the protein can be labelled 
by any of the above-mentioned techniques, e.g., fluorescently, enzymatically or radioactively, or 
detected by immunoassay. 

In addition to binding studies, functional assays can be used to identified modulators, i.e., 
10 agonists or antagonists, of hedgehog or patched activities. By detecting changes in intracellular 
signals, such as alterations in second messengers or gene expression, in /?a/c/ie</-expressing cells 
contacted with a test agent, candidate agonists and antagonists to patched signaling can be 
P identified. 

£V A number of gene products have been implicated in patched-mediated signal 

!TJ 15 transduction, including patched, the transcription factor cubitus interruptus (ci), the 
serine/threonine kinase fused (fu) and the gene products of costal-2, smoothened and suppressor 
EP of fused. 

a The interaction of a hedgehog protein with patched sets in motion a cascade involving the 

: f[ activation and inhibition of downstream effectors, the ultimate consequence of which is, in some 
hi 20 instances, a detectable change in the transcription or translation of a gene. Potential 
S3 transcriptional targets of patched signaling are the patched gene itself (Hidalgo and Ingham, 
1990 Development 110, 291-301; Marigo et al., 1996 ) and the vertebrate homologs of the 
drosophila cubitus interruptus gene, the GLI genes (Hui et al. (1994) Dev Biol 162:402-413). 
Patched gene expression has been shown to be induced in cells of the limb bud and the neural 
25 plate that are responsive to Shh. (Marigo et al. (1996) PNAS, in press; Marigo et al. (1996) 
Development 122:1225-1233). The GLI genes encode putative transcription factors having zinc 
finger DNA binding domains (Orenic et al. (1990) Genes & Dev 4:1053-1067; Kinzler et al. 
(1990) Mol Cell Biol 10:634-642). Transcription of the GLI gene has been reported to be 
upregulated in response to hedgehog in limb buds, while transcription of the GLI3 gene is 
30 downregulated in response to hedgehog induction (Marigo et al. (1996) Development 122:1225- 
1233). By selecting transcriptional regulatory sequences from such target genes, e.g. from 
patched or GLI genes, that are responsible for the up- or down regulation of these genes in 
response to patched signalling, and operatively linking such promoters to a reporter gene, one 
can derive a transcription based assay which is sensitive to the ability of a specific test compound 
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to modify patched signalling pathways. Expression of the reporter gene, thus, provides a valuable 
screening tool for the development of compounds that act as agonists or antagonists of ptc 
induction of differentiation/quiescence. 

Reporter gene based assays of this invention measure the end stage of the above 
5 described cascade of events, e.g., transcriptional modulation. Accordingly, in practicing one 
embodiment of the assay, a reporter gene construct is inserted into the reagent cell in order to 
generate a detection signal dependent on ptc signaling. To identify potential regulatory elements 
responsive to ptc signaling present in the transcriptional regulatory sequence of a target gene, 
nested deletions of genomic clones of the target gene can be constructed using standard 
10 techniques. See, for example, Current Protocols in Molecular Biology, Ausubel, F.M. et al. 
(eds.) Greene Publishing Associates, (1989); U.S. Patent 5,266,488; Sato et al. (1995) J Biol 
Chem 270:10314-10322; and Kube et al. (1995) Cytokine 7:1-7. A nested set of DNA fragments 
from the gene's 5 f -flanking region are placed upstream of a reporter gene, such as the luciferase 
gene, and assayed for their ability to direct reporter gene expression in patched expressing cells. 
^ 15 Host cells transiently transfected with reporter gene constructs can be scored for the induction of 
£ expression of the reporter gene in the presence and absence of hedgehog to determine regulatory 
^ sequences which are responsice to patched-dependcnt signalling. 

4- In practicing one embodiment of the assay, a reporter gene construct is inserted into the 

j\ reagent cell in order to generate a detection signal dependent on second messengers generated by 
u* 20 induction with hedgehog protein. Typically, the reporter gene construct will include a reporter 
tjf gene in operative linkage with one or more transcriptional regulatory elements responsive to the 
hedgehog activity, with the level of expression of the reporter gene providing the hedgehog- 
dependent detection signal. The amount of transcription from the reporter gene may be 
measured using any method known to those of skill in the art to be suitable. For example, mRNA 
25 expression from the reporter gene may be detected using RNAse protection or RNA-based PCR, 
or the protein product of the reporter gene may be identified by a characteristic stain or an 
intrinsic activity. The amount of expression from the reporter gene is then compared to the 
amount of expression in either the same cell in the absence of the test compound (or hedgehog) 
or it may be compared with the amount of transcription in a substantially identical cell that lacks 
30 the target receptor protein. Any statistically or otherwise significant difference in the amount of 
transcription indicates that the test compound has in some manner altered the signal transduction 
of the patched protein, e.g., the test compound is a potential ptc therapeutic. 

As described in further detail below, in preferred embodiments the gene product of the 
reporter is detected by an intrinsic activity associated with that product. For instance, the 
35 reporter gene may encode a gene product that, by enzymatic activity, gives rise to a detection 
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signal based on color, fluorescence, or luminescence. In other preferred embodiments, the 
reporter or marker gene provides a selective growth advantage, e.g., the reporter gene may 
enhance cell viability, relieve a cell nutritional requirement, and/or provide resistance to a drug. 

Preferred reporter genes are those that are readily detectable. The reporter gene may also 
be included in the construct in the form of a fusion gene with a gene that includes desired 
transcriptional regulatory sequences or exhibits other desirable properties. Examples of reporter 
genes include, but are not limited to CAT (chloramphenicol acetyl transferase) (Alton and 
Vapnek (1979), Nature 282: 864-869) luciferase, and other enzyme detection systems, such as 
beta-galactosidase; firefly luciferase (deWet et al. (1987), Mol. Cell. Biol. 7:725-737); bacterial 
luciferase (Engebrecht and Silverman (1984), PNAS 1: 4154-4158; Baldwin et al (1984), 
Biochemistry 23: 3663-3667); alkaline phosphatase (Toh et al. (1989) Eur. J. Biochem. 182: 
231-238, Hall et al. (1983) J. Mol. Appl. Gen. 2: 101), human placental secreted alkaline 
phosphatase (Cullen and Malim (1992) Methods in Enzymol. 216:362-368). 

Transcriptional control elements which may be included in a reporter gene construct 
include, but are not limited to, promoters, enhancers, and repressor and activator binding sites. 
Suitable transcriptional regulatory elements may be derived from the transcriptional regulatory 
regions of genes whose expression is induced after modulation of a patched signal transduction 
pathway. The characteristics of preferred genes from which the transcriptional control elements 
are derived include, but are not limited to, low or undetectable expression in quiescent cells, 
rapid induction at the transcriptional level within minutes of extracellular simulation, induction 
that is transient and independent of new protein synthesis, subsequent shut-off of transcription 
requires new protein synthesis, and mRNAs transcribed from these genes have a short half-life. It 
is not necessary for all of these properties to be present. 

In yet other embodiments, second messenger generation can be measured directly in the 
detection step, such as mobilization of intracellular calcium, phospholipid metabolism or 
adenylate cyclase activity are quantitated, for instance, the products of phospholipid hydrolysis 
IP3, DAG or cAMP could be measured For example, recent studies have implicated protein 

kinase A (PKA) as a possible component of hedgehog/patched signaling (Hammerschmidt et al. 
(1996) Genes & Dev 10:647). High PKA activity has been shown to antagonize hedgehog 
signaling in these systems. Although it is unclear whether PKA acts directly downstream or in 
parallel with hedgehog signaling, it is possible that hedgehog signalling occurs via inhibition of 
PKA activity. Thus, detection of PKA activity provides a potential readout for the instant assays. 

In a preferred embodiment, the ptc therapeutic is a PKA inhibitor. A variety of PKA 
inhibitors are known in the art, including both peptidyl and organic compounds. For instance, 
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the pic therapeutic can be a 5-isoquinolinesulfonamide, such as represented in the general 
formula: 




I 

0=S-0 




wherein, 

R] and R 2 each can independently represent hydrogen, and as valence and stability permit 
a lower alkyl, a lower alkenyl, a lower alkynyl, a carbonyl (such as a carboxyl, an ester, a 
formate, or a ketone), a thiocarbonyl (such as a thioester, a thioacetate, or a thioformate), an 

amino, an acylamino, an amido, a cyano, a riitro, an azido, a sulfate, a sulfonate, a sulfonamido, 
-(CH 2 ) m -R 8 , -(CH 2 ) m -OH, -(CH 2 ) m -0-lower alkyl, -(CH 2 ) m -0-lower alkenyl, -(CH 2 ) n -0- 
(CH 2 ) m -R 8 , -(CH 2 ) m -SH, -(CH 2 ) m -S-lower alkyl, -(CH 2 ) m -S-lower alkenyl, -(CH 2 VS-(CH 2 ) m - 
R 8 , or 

Rl and R 2 taken together with N form a heterocycle (substituted or unsubstituted); 

R3 is absent or represents one or more substitutions to the isoquinoline ring such as a 
lower alkyl, a lower alkenyl, a lower alkynyl, a carbonyl (such as a carboxyl, an ester, a formate, 
or a ketone), a thiocarbonyl (such as a thioester, a thioacetate, or a thioformate), an amino, an 
acylamino, an amidd, a cyano, a nitro, an azido, a sulfate, a sulfonate, a sulfonamido, -(CH 2 ) m - 
*8> -(CH 2 ) m -OH, -(CH 2 ) m -0-lower alkyl, -(CH 2 ) m -0-lower alkenyl, -(CH 2 ) n -0-(CH 2 ) m -R 8 , 
-(CH 2 ) m -SH, -(CH 2 ) m -S-lower alkyl, -(CH 2 ) m -S-lower alkenyl, -(CH 2 ) n -S-(CH 2 ) m -R 8 ; 

R 8 represents a substituted or unsubstituted aryl, aralkyl, cycloalkyl, cycloalkenyl, or 
heterocycle; and 

n and m are independently for each occurrence zero or an integer in the range of 1 to 6. 

In a preferred embodiment, the PKA inhibitor is N-[2-((p-bromocinnamyl)amino)ethyl]-5- 
isoquinolinesulfonamide (H-89; Calbiochem Cat. No. 371963), e.g., having the formula: 
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In another embodiment, the PKA inhibitor is l-(5-isoquinolinesulfonyl)-2-methylpiperazine (H- 
7; Calbiochem Cat. No. 371955), e.g., having the formula: 




5 In still other embodiments, the PKA inhibitor is KT5720 (Calbiochem Cat. No. 420315), having 
the structure 

OH 

CH 3 (CH 2 ) 4 CH 2 OOC* 1%1, 




A variety of nucleoside analogs are also useful as PKA inhibitors. For example, the subject 
method can be carried out cyclic AMP analogs which inhibit the kinase activity of PKA, as for 
1 0 example, 8-bromo-cAMP or dibutyryl-c AMP 
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Exemplary peptidyl inhibitors of PKA activity include the PKA Heat Stable Inhibitor 
(isoform a; see, for example, Calbiochem Cat. No. 539488, and Wen et al. (1995) J Biol Chem 
270:2041). 

Certain hedehog receptors may stimulate the activity of phospholipases. Inositol lipids 
can be extracted and analyzed using standard lipid extraction techniques. Water soluble 
derivatives of all three inositol lipids (D^, DP 2 , IP3) can also be quantitated using radiolabelling 

techniques or HPLC. 

The mobilization of intracellular calcium or the influx of calcium from outside the cell 
may be a response to hedgehog stimulation or lack there of Calcium flux in the reagent cell can 
be measured using standard techniques. The choice of the appropriate calcium indicator, 
fluorescent, bioluminescent, metallochromic, or Ca^-sensitive microelectrodes depends on the 
cell type and the magnitude and time constant of the event under study (Borle (1990) Environ 
Health Perspect 84:45-56). As an exemplary method of Ca"^ detection, cells could be loaded 
with the Ca^sensitive fluorescent dye fiira-2 or indo-1, using standard methods, and any change 
in Ca"^ measured using a fluorometer. 

In certain embodiments of the assay, it may be desirable to screen for changes in cellular 
phosphorylation. As an example, the drosophila gene fused (fu) which encodes a 
serine/threonine kinase has been identified as a potential downstream target in hedgehog 
signaling. (Preat et al., 1990 Nature 347, 87-89; Therond et al. 1993, Meek Dev. 44. 65-80). 
The ability of compounds to modulate serine/threonine kinase activation could be screened using 
colony immunoblotting (Lyons and Nelson (1984) Proc. Natl Acad. Sci. USA 81:7426-7430) 
using antibodies against phosphorylated serine or threonine residues. Reagents for performing 
such assays are commercially available, for example, phosphoserine and phosphothreonine 
specific antibodies which measure increases in phosphorylation of those residues can be 
purchased from comercial sources. 
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In yet another embodiment, the ptc therapeutic is an antisense molecule which inhibits 
expression of a protein involved in a /?a/c/*e<i-mediated signal transduction pathway. To 
illustrate, by inhibiting the expression of a protein which are involved in patched signals, such as 
fused, costal-2, smoothened and/or Gli genes, the ability of the patched signal pathway(s) to 
inhibit proliferation of a cell can be altered, e.g., potentiated or repressed. 

As used herein, "antisense" therapy refers to administration or in situ generation of 
oligonucleotide probes or their derivatives which specifically hybridize (e.g. bind) under cellular 
conditions with cellular mRNA and/or genomic DNA encoding a hedgehog protein, patched, or a 
protein involved in patched-mediated signal transduction. The hybridization should inhibit 
expression of that protein, e.g. by inhibiting transcription and/or translation. The binding may be 
by conventional base pair complementarity, or, for example, in the case of binding to DNA 
duplexes, through specific interactions in the major groove of the double helix. In general, 
"antisense" therapy refers to the range of techniques generally employed in the art, and includes 
any therapy which relies on specific binding to oligonucleotide sequences. 

An antisense construct of the present invention can be delivered, for example, as an 
expression plasmid which, when transcribed in the cell, produces RNA which is complementary 
to at least a unique portion of the target cellular mRNA. Alternatively, the antisense construct is 
an oligonucleotide probe which is generated ex vivo and which, when introduced into the cell 
causes inhibition of expression by hybridizing with the mRNA and/or genomic sequences of a 
target gene. Such oligonucleotide probes are preferably modified oligonucleotide which are 
resistant to endogenous nucleases, e.g. exonucleases and/or endonucleases, and is therefore 
stable in vivo. Exemplary nucleic acid molecules for use as antisense oligonucleotides are 
phosphoramidate, phosphothioate and methylphosphonate analogs of DNA (see also U.S. Patents 
5,176,996; 5,264,564; and 5,256,775). Additionally, general approaches to constructing 
oligomers useful in antisense therapy have been reviewed, for example, by Van der Krol et al. 
(1988) Biotechniques 6:958-976; and Stein et al. (1988) Cancer Res 48:2659-2668. 

Several considerations should be taken into account when constructing antisense 
oligonucleotides for the use in the methods of the invention: (1) oligos should have a GC 
content of 50% or more; (2) avoid sequences with stretches of 3 or more G's; and (3) 
oligonucleotides should not be longer than 25-26 mers. When testing an antisense 
oligonucleotide, a mismatched control can be constructed. The controls can be generated by 
reversing the sequence order of the corresponding antisense oligonucleotide in order to conserve 
the same ratio of bases. 
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In an illustrative 
inhibiting the expression 
Exemplary antisense consl 



embodiment, the ptc therapeutic can be an antisense construct for 
of patched, e.g., to mimic the inhibition of patched by hedgehog. 
cts include: 



5 '-GTCCTGGCGCCGCCGCCGCCGTCGCC 
5 '-TTCCGATGACCGGCCTTTCGCGGTGA 
5 '-GTGCACGGAAAGGTGCAGGCCAC ACT 



VL Exemplary pharmaceutical preparations of hedgehog and ptc therapeutics 

The source of the hedgehog and ptc therapeutics to be formulated will depend on the 
particular form of the agent. Small organic molecules and peptidyl fragments can be chemically 
synthesized and provided in a pure form suitable for pharmaceutical/cosmetic usage. Products of 
natural extracts can be purified according to techniques known in the art. For example, the Cox 
et al. U.S. Patent 5,286,654 describes a method for purifying naturally occurring forms of a 
secreted protein and can be adapted for purification of hedgehog polypeptides. Recombinant 
sources of hedgehog polypeptides are also available. For example, the gene encoding hedgehog 
polypeptides, are known, inter alia, from PCT publications WO 95/18856 and WO 96/17924. 

Those of skill in treating peripheral neuropathies can determine the effective amount of 
an hedgehog or ptc therapeutic to be formulated in a pharmaceutical or cosmetic preparation. 

The hedgehog or ptc therapeutic formulations used in the method of the invention are 
most preferably applied in the form of appropriate compositions. As appropriate compositions 
there may be cited all compositions usually employed for systemically or topically administering 
drugs. The pharmaceutically acceptable carrier should be substantially inert, so as not to act with 
the active component. Suitable inert carriers include water, alcohol polyethylene glycol, mineral 
oil or petroleum gel, propylene glycol and the like. 

To prepare the pharmaceutical compositions of this invention, an effective amount of the 
particular hedgehog or ptc therapeutic as the active ingredient is combined in intimate admixture 
with a pharmaceutically acceptable carrier, which carrier may take a wide variety of forms 
depending on the form of preparation desired for administration. These pharmaceutical 
compositions are desirable in unitary dosage form suitable, particularly, for administration orally, 
rectally, percutaneously, or by parenteral injection. For example, in preparing the compositions in 
oral dosage form, any of the usual pharmaceutical media may be employed such as, for example, 
water, glycols, oils, alcohols and the like in the case of oral liquid preparations such as 
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suspensions, syrups, elixirs and solutions; or solid carriers such as starches, sugars, kaolin, 
lubricants, binders, disintegrating agents and the like in the case of powders, pills, capsules, and 
tablets. Because of their ease in administration, tablets and capsules represents the most 
advantageous oral dosage unit form, in which case solid pharmaceutical carriers are obviously 
5 employed. For parenteral compositions, the carrier will usually comprise sterile water, at least in 
large part, though other ingredients, for example, to aid solubility, may be included. Injectable 
solutions, for example, may be prepared in which the carrier comprises saline solution, glucose 
solution or a mixture of saline and glucose solution. Injectable suspensions may also be prepared 
in which case appropriate liquid carriers, suspending agents and the like may be employed. Also 
10 included are solid form preparations which are intended to be converted, shortly before use, to 
liquid form preparations. In the compositons suitable for percutaneous administration, the carrier 
optionally comprises a penetration enhancing agent and/or a suitable wetting agent, optionally 
combined with suitable additives of any nature in minor proportions, which additives do not 
introduce a significant deleterious effect on the skin. 

15 In addition to the direct topical application of the preparations they can be topically 

administered by other methods, for example, encapsulated in a temperature and/or pressure 
sensitive matrix or in film or solid carrier which is soluble in body fluids and the like for 
subsequent release, preferably sustained-release of the active component. 

As appropriate compositions for topical application there may be cited all compositions 
20 usually employed for topically administering therapeuitcs, e.g., creams, gellies, dressings, 
shampoos, tinctures, pastes, ointments, salves, powders, liquid or semiliquid formulation and the 
like. Application of said compositions may be by aerosol e.g. with a propellent such as nitrogen 
carbon dioxide, a freon, or without a propellent such as a pump spray, drops, lotions, or a 
semisolid such as a thickened composition which can be applied by a swab. In particular 
25 compositions, semisolid compositions such as salves, creams, pastes, gellies, ointments and the 
like will conveniently be used. 

It is especially advantageous to formulate the subject compositions in dosage unit form 
for ease of administration and uniformity of dosage. Dosage unit form as used in the specification 
and claims herein refers to physically discreate units suitable as unitary dosages, each unit 
30 containing a predetermined quantity of active ingredient calculated to produce the desired 
therapeutic effect in association with the required pharmaceutical carrier. Examples of such 
dosage unit forms are tablets (including scored or coated tablets), capsules, pills, powders 
packets, wafers, injectable solutions or suspensions, teaspoonfuls, tablespoonfuls and the like, 
and segregated multiples thereof. 
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The pharmaceutical preparations of the present invention can be used, as stated above, for 
the many applications whcih can be considered cosmetic uses. Cosmetic compositions known in 
the art, preferably hypoallergic and pH controlled are especially preferred, and include toilet 
waters, packs, lotions, skin milks or milky lotions. The preparations contain, besides the 
5 hedgehog or ptc therapeutic, components usually employed in such preparations. Examples of 
such components are oils, fats, waxes, surfactants, humectants, thickening agents, antioxidants, 
viscosity stabilizers, chelating agents, buffers, preservatives, perfumes, dyestuffs, lower alkanols, 
and the like. If desired, further ingredients may be incorporated in the compositions, e.g. 
antiinflammatory agents, antibacterials, antifungals, disinfectants, vitamins, sunscreens, 
1 0 antibiotics, or other anti-acne agents. 

Examples of oils comprise fats and oils such as olive oil and hydrogenated oils; waxes 
such as beeswax and lanolin; hydrocarbons such as liquid paraffin, ceresin, and squalane; fatty 
q acids such as stearic acid and oleic acid; alcohols such as cetyl alcohol, stearyl alcohol, lanolin 
J3 alcohol, and hexadecanol; and esters such as isopropyl myristate, isopropyl palmitate and butyl 
^15 stearate. As examples of surfactants there may be cited anionic surfactants such as sodium 
stearate, sodium cetylsulfate, polyoxyethylene laurylether phosphate, sodium N-acyl glutamate; 
cationic surfactants such as stearyldimethylbenzylammonium chloride and 
B g stearyltrimethylammonium chloride; ampholytic surfactants such as alkylaminoethylglycine 
f hydrocloride solutions and lecithin; and nonionic surfactants such as glycerin monostearate, 

L 20 sorbitan monostearate, sucrose fatty acid esters, propylene glycol monostearate, polyoxyethylene 
fy oleylether, polyethylene glycol monostearate, polyoxyethylene sorbitan monopalmitate, 
polyoxyethylene coconut fatty acid monoethanolamide, polyoxypropylene glycol (e.g. the 
q materials sold under the trademark "Pluronic"), polyoxyethylene castor oil, and polyoxyethylene 
lanolin. Examples of humectants include glycerin, 1,3-butylene glycol, and propylene glycol; 
25 examples of lower alcohols include ethanol and isopropanol; examples of thickening agents 
include xanthan gum, hydroxypropyl cellulose, hydroxypropyl methyl cellulose, polyethylene 
glycol and sodium carboxymethyl cellulose; examples of antioxidants comprise butylated 
hydroxytoluene, butylated hydroxyanisole, propyl gallate, citric acid and ethoxyquin; examples 
of chelating agents include disodium edetate and ethanehydroxy diphosphate; examples of 
30 buffers comprise citric acid, sodium citrate, boric acid, borax, and disodium hydrogen phosphate; 
and examples of preservatives are methyl parahydroxybenzoate, ethyl parahydroxybenzoate, 
dehydroacetic acid, salicylic acid and benzoic acid. 

For preparing ointments, creams, toilet waters, skin milks, and the like, typically from 
0.01 to 10% in particular from 0.1 to 5% and more in particular from 0.2 to 2.5% of the active 
35 ingredient, e.g., of the hedgehog or ptc therapeutic, will be incorporated in the compositions. In 
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ointments or creams, the carrier for example consists of 1 to 20%, in particular 5 to 15% of a 
humectant, 0.1 to 10% in particular from 0.5 to 5% of a thickener and water; or said carrier may 
consist of 70 to 99%, in particular 20 to 95% of a surfactant, and 0 to 20%, in particular 2.5 to 
15% of a fat; or 80 to 99.9% in particular 90 to 99% of a thickener; or 5 to 15% of a surfactant, 
5 2-15% of a humectant, 0 to 80% of an oil, very small ( < 2%) amounts of preservative, coloring 
agent and/or perfume, and water. In a toilet water, the carrier for example consists of 2 to 10% of 
a lower alcohol, 0.1 to 10% or in particular 0.5 to 1% of a surfactant, 1 to 20%, in particular 3 to 
7% of a humectant, 0 to 5% of a buffer, water and small amounts ( < 2%) of preservative, 
dyestuff and/or perfume. In a skin milk, the carrier typically consists of 10-50% of oil, 1 to 10% 
10 of surfactant, 50-80% of water and 0 to 3% of preservative and/or perfume, hi the 
aforementioned preparations, all % symbols refer to weight by weight percentage. 

Particular compositions for use in the method of the present invention are those wherein 
^ the hedgehog or ptc therapeutic is formulated in liposome-containing compositions. Liposomes 
*Jj are artificial vesicles formed by amphiphatic molecules such as polar lipids, for example, 
^15 phosphatidyl cholines, ethanolamines and serines, sphingomyelins, cardiolipins, plasmalogens, 
jp phosphatidic acids and cerebiosides. Liposomes are formed when suitable amphiphathic 
^ molecules are allowed to swell in water or aqueous solutions to form liquid crystals usually of 

1 5 a 

jg multilayer structure comprised of many bilayers separated from each other by aqueous material 
« (also referred to as coarse liposomes). Another type of liposome known to be consisting of a 

20 single bilayer encapsulating aqueous material is referred to as a unilamellar vesicle. If water- 
fU soluble materials are included in the aqueous phase during the swelling of the lipids they become 
if entrapped in the aqueous layer between the lipid bilayers. 

^ Water-soluble active ingredients such as, for example, various salt forms of a hedgehog 

polypeptide, are encapsulated in the aqueous spaces between the molecular layers. The lipid 

25 soluble active ingredient of hedgehog or ptc therapeutic, such as an organic mimetic, is 
predominantly incorporated into the lipid layers, although polar head groups may protude from 
the layer into the aqueous space. The encapsulation of these compounds can be achieved by a 
number of methods. The method most commonly used involves casting a thin film of 
phospholipid onto the walls of a flask by evaporation from an organic solvent. When this film is 

30 dispersed in a suitable aqueous medium, multilamellar liposomes are formed. Upon suitable 
sonication, the coarse liposomes form smaller similarly closed vesicles. 

Water-soluble active ingredients are usually incorporated by dispersing the cast film with 
an aqueous solution of the compound. The unencapsulated compound is then removed by 
centrifugation, chromatography, dialysis or other art-known suitable procedures. The lipid- 
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soluble active ingredient is usually incorporated by dissolving it in the organic solvent with the 
phospholipid prior to casting the film. If the solubility of the material in the lipid phase is not 
exceeded or the amount present is not in excess of that which can be bound to the lipid, 
liposomes prepared by the above method usually contain most of the material bound in the lipid 
5 bilayers; separation of the liposomes from unencapsulated material is not required. 

A particularly convenient method for preparing liposome formulated forms of hedgehog 
and ptc therapeutics is the method described in EP-A-253,619, incorporated herein by reference. 
In this method, single bilayered liposomes containing encapsulated active ingredients are 
prepared by dissolving the lipid component in an organic medium, injecting the organic solution 
10 of the lipid component under pressure into an aqueous component while simultaneously mixing 
the organic and aqueous components with a high speed homogenizer or mixing means, 
whereupon the liposomes are formed spontaneously. 

□ The single bilayered liposomes containing the encapsulated hedgehog or ptc therapeutic 

can be employed directly or they can be employed in a suitable pharmaceutically acceptable 
|ij 15 carrier for topical administration. The viscosity of the liposomes can be increased by the addition 
IT of one or more suitable thickening agents such as, for example xanthan gum, hydroxypropyl 
[rj cellulose, hydroxypropyl methylcellulose and mixtures thereof. The aqueous component may 
+= consist of water alone or it may contain electrolytes, buffered systems and other ingredients, such 
as, for example, preservatives. Suitable electrolytes which can be employed include metal salts 
r* 20 such as alkali metal and alkaline earth metal salts. The preferred metal salts are calcium chloride, 
sodium chloride and potassium chloride. The concentration of the electrolyte may vary from zero 
to 260 mM, preferably from 5 mM to 160 mM. The aqueous component is placed in a suitable 
vessel which can be adapted to effect homogenization by effecting great turbulence during the 
injection of the organic component. Homogenization of the two components can be 
25 accomplished within the vessel, or, alternatively, the aqueous and organic components may be 
injected separately into a mixing means which is located outside the vessel. In the latter case, the 
liposomes are formed in the mixing means and then transferred to another vessel for collection 
purpose. 

The organic component consists of a suitable non-toxic, pharmaceutically acceptable 
30 solvent such as, for example ethanol, glycerol, propylene glycol and polyethylene glycol, and a 
suitable phospholipid which is soluble in the solvent. Suitable phospholipids which can be 
employed include lecithin, phosphatidylcholine, phosphatydylserine, phosphatidylethanol-amine, 
phosphatidylinositol, lysophosphatidylcholine and phospha-tidyl glycerol, for example. Other 
lipophilic additives may be employed in order to selectively modify the characteristics of the 



lis 
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liposomes. Examples of such other additives include stearylamine, phosphatidic acid, tocopherol, 
cholesterol and lanolin extracts. 

In addition, other ingredients which can prevent oxidation of the phospholipids may be 
added to the organic component. Examples of such other ingredients include tocopherol, 
5 butylated hydroxyanisole, butylated hydroxytoluene, ascorbyl palmitate and ascorbyl oleate. 
Preservatives such a benzoic acid, methyl paraben and propyl paraben may also be added. 

Apart from the above-described compositions, use may be made of covers, e.g. plasters, 
bandages, dressings, gauze pads and the like, containing an appropriate amount of a hedgehog or 
ptc therapeutic. In some cases use may be made of plasters, bandages, dressings, gauze pads and 
10 the like which have been impregnated with a topical formulation containing the therapeutic 
formulation. 

VII Exemplary Hedgehog Antagonists 
i£ Exemplary hedgehog antagonists of the invention include cyclopamine, tomatidine, 

pTI jervine, AY9944, triparanol, forskolin, cAMP, dibutyryl cAMP (and other hydrophobically 
4^ 15 modified cAMP variants) and fictionally effective derivatives thereof. Additional hedgehog 
yl antagonists, functionally effective derivatives, compositions and methods for making such 
J 88 compounds are described in detail in the following U.S. Patent applications: Baxter et al., 

^ 09/663,835; Beachy et al. entitled "Inhibitors of hedgehog signaling pathways, compositions and 

p~ ■ , 

fij uses related thereto" filed October 10, 2000; Baxter et al. "Mediators of hedgehog signaling 
•Sj 20 pathway, compositions and uses related thereto" filed October 13, 2000; Philip Beachy, 
P "Regulators of the hedgehog pathway, compositions and uses related thereto" filed October 13, 
2000. 



25 Evaluation of the effects of hedgehog on T-cell development 

The invention now being generally described, it will be more readily understood by 
reference to the following examples which are included merely for purposes of illustration of 
certain aspects and embodiments of the present invention, and are not intended to limit the 
invention. 

30 Utilizing a fetal thymic organ culture, the effects of Shh on T-cell maturation was 

detemined. Briefly, immature thymocytes (CD4-/CD8- [DN]) were isolated from E14 thymus. 
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The cells were culture for about 7 days, and the level of mature thymocytes, e.g., CD4+/CD8+ 
[DP], was assessed. In certain cultures, octyl-modifed Shh was added to the culture; in other 
cultures, the anti-hedgehog 5E1 antibody was added. As illustrated in Figures 1-4, the addition 
of Shh to the culture resulted in an increase in the percentage of cells after 7 days with markers 
indicative of immature thymocytes. Conversely, addition of the 5E1 antibody resulted in an 
increase in the percentage of mature T lymphocytes after 7 days. 

All of the above-cited references and publications are hereby incorporated by reference. 
Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than routine 
experimentation, numerous equivalents to the specific polypeptides, nucleic acids, methods, 
assays and reagents described herein. Such equivalents are considered to be within the scope of 
this invention. 
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SEQUENCE LISTING 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1275 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATG GTC GAA ATG CTG CTG TTG ACA AGA ATT CTC TTG GTG GGC TTC ATC 
Met Val Glu Met Leu Leu Leu Thr Arg lie Leu Leu Val Gly Phe lie 
1 5 10 15 

TGC GCT CTT TTA GTC TCC TCT GGG CTG ACT TGT GGA CCA GGC AGG GGC 
Cys Ala Leu Leu Val Ser Ser Gly Leu Thr Cys Gly Pro Gly Arg Gly 
20 25 30 

ATT GGA AAA AGG AGG CAC CCC AAA AAG CTG ACC CCG TTA GCC TAT AAG 
lie Gly Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys 
35 40 45 

CAG TTT ATT CCC AAT GTG GCA GAG AAG ACC CTA GGG GCC AGT GGA AGA 
Gin Phe lie Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg 
50 55 60 

TAT GAA GGG AAG ATC ACA AGA AAC TCC GAG AGA TTT AAA GAA CTA ACC 
Tyr Glu Gly Lys lie Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr 
65 70 75 80 

CCA AAT TAC AAC CCT GAC ATT ATT TTT AAG GAT GAA GAG AAC ACG GGA 
Pro Asn Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Thr Gly 
85 90 95 

GCT GAC AGA CTG ATG ACT CAG CGC TGC AAG GAC AAG CTG AAT GCC CTG 
Ala Asp Arg Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu 
100 105 110 

GCG ATC TCG GTG ATG AAC CAG TGG CCC GGG GTG AAG CTG CGG GTG ACC 
Ala He Ser Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg Val Thr 
115 120 125 

GAG GGC TGG GAC GAG GAT GGC CAT CAC TCC GAG GAA TCG CTG CAC TAC 
Glu Gly Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr 
130 135 140 

GAG GGT CGC GCC GTG GAC ATC ACC ACG TCG GAT CGG GAC CGC AGC AAG 
Glu Gly Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Arg Ser Lys 
145 150 155 160 
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TAC GGA ATG CTG GCC CGC CTC GCC GTC GAG GCC GGC TTC GAC TGG GTC 52 8 

Tyr Gly Met Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val 
165 170 175 

5 

TAC TAC GAG TCC AAG GCG CAC ATC CAC TGC TCC GTC AAA GCA GAA AAC 576 
Tyr Tyr Glu Ser Lys Ala His lie His Cys Ser Val Lys Ala Glu Asn 
180 185 190 

10 TCA GTG GCA GCG AAA TCA GGA GGC TGC TTC CCT GGC TCA GCC ACA GTG 624 
Ser Val Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val 
195 200 205 

CAC CTG GAG CAT GGA GGC ACC AAG CTG GTG AAG GAC CTG AGC CCT GGG 672 
15 His Leu Glu His Gly Gly Thr Lys Leu Val Lys Asp Leu Ser Pro Gly 
210 215 220 

GAC CGC GTG CTG GCT GCT GAC GCG GAC GGC CGG CTG CTC TAC AGT GAC 720 
Asp Arg Val Leu Ala Ala Asp Ala Asp Gly Arg Leu Leu Tyr Ser Asp 
20 225 230 235 240 

TTC CTC ACC TTC CTC GAC CGG ATG GAC AGC TCC CGA AAG CTC TTC TAC 768 
Phe Leu Thr Phe Leu Asp Arg Met Asp Ser Ser Arg Lys Leu Phe Tyr 
245 250 255 

25 

GTC ATC GAG ACG CGG CAG CCC CGG GCC CGG CTG CTA CTG ACG GCG GCC 816 
Val lie Glu Thr Arg Gin Pro Arg Ala Arg Leu Leu Leu Thr Ala Ala 
260 265 270 

30 CAC CTG CTC TTT GTG GCC CCC CAG CAC AAC CAG TCG GAG GCC ACA GGG 864 
His Leu Leu Phe Val Ala Pro Gin His Asn Gin Ser Glu Ala Thr Gly 
275 280 285 

TCC ACC AGT GGC CAG GCG CTC TTC GCC AGC AAC GTG AAG CCT GGC CAA 912 
35 Ser Thr Ser Gly Gin Ala Leu Phe Ala Ser Asn Val Lys Pro Gly Gin 
290 295 300 

CGT GTC TAT GTG CTG GGC GAG GGC GGG CAG CAG CTG CTG CCG GCG TCT 960 
Arg Val Tyr Val Leu Gly Glu Gly Gly Gin Gin Leu Leu Pro Ala Ser 
40 305 310 315 320 

GTC CAC AGC GTC TCA TTG CGG GAG GAG GCG TCC GGA GCC TAC GCC CCA 1008 
Val His Ser Val Ser Leu Arg Glu Glu Ala Ser Gly Ala Tyr Ala Pro 
325 330 335 



45 



CTC ACC GCC CAG GGC ACC ATC CTC ATC AAC CGG GTG TTG GCC TCC TGC 1056 
Leu Thr Ala Gin Gly Thr lie Leu lie Asn Arg Val Leu Ala Ser Cys 
340 345 350 



50 TAC GCC GTC ATC GAG GAG CAC AGT TGG GCC CAT TGG GCC TTC GCA CCA 1104 
Tyr Ala Val He Glu Glu His Ser Trp Ala His Trp Ala Phe Ala Pro 
355 360 365 

TTC CGC TTG GCT CAG GGG CTG CTG GCC GCC CTC TGC CCA GAT GGG GCC 1152 
55 Phe Arg Leu Ala Gin Gly Leu Leu Ala Ala Leu Cys Pro Asp Gly Ala 
370 375 380 

ATC CCT ACT GCC GCC ACC ACC ACC ACT GGC ATC CAT TGG TAC TCA CGG 12 00 

He Pro Thr Ala Ala Thr Thr Thr Thr Gly He His Trp Tyr Ser Arg 
60 385 390 395 400 
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CTC CTC TAC CGC ATC GGC AGC TGG GTG CTG GAT GGT GAC GCG CTG CAT 
Leu Leu Tyr Arg lie Gly Ser Trp Val Leu Asp Gly Asp Ala Leu His 
405 410 415 

CCG CTG GGC ATG GTG GCA CCG GCC AGC TG 
Pro Leu Gly Met Val Ala Pro Ala Ser 
420 425 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1190 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1. .1191 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

ATG GCT CTG CCG GCC AGT CTG TTG CCC CTG TGC TGC TTG GCA CTC TTG 
Met Ala Leu Pro Ala Ser Leu Leu Pro Leu Cys Cys Leu Ala Leu Leu 
1 5 10 15 

GCA CTA TCT GCC CAG AGC TGC GGG CCG GGC CGA GGA CCG GTT GGC CGG 
Ala Leu Ser Ala Gin Ser Cys Gly Pro Gly Arg Gly Pro Val Gly Arg 
20 25 30 

CGG CGT TAT GTG CGC AAG CAA CTT GTG CCT CTG CTA TAC AAG CAG TTT 
Arg Arg Tyr Val Arg Lys Gin Leu Val Pro Leu Leu Tyr Lys Gin Phe 
35 40 45 

GTG CCC AGT ATG CCC GAG CGG ACC CTG GGC GCG AGT GGG CCA GCG GAG 
Val Pro Ser Met Pro Glu Arg Thr Leu Gly Ala Ser Gly Pro Ala Glu 
50 55 60 

GGG AGG GTA ACA AGG GGG TCG GAG CGC TTC CGG GAC CTC GTA CCC AAC 
Gly Arg Val Thr Arg Gly Ser Glu Arg Phe Arg Asp Leu Val Pro Asn 
65 70 75 80 

TAC AAC CCC GAC ATA ATC TTC AAG GAT GAG GAG AAC AGC GGC GCA GAC 
Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Ser Gly Ala Asp 
85 90 95 

CGC CTG ATG ACA GAG CGT TGC AAA GAG CGG GTG AAC GCT CTA GCC ATC 
Arg Leu Met Thr Glu Arg Cys Lys Glu Arg Val Asn Ala Leu Ala lie 
100 105 110 

GCG GTG ATG AAC ATG TGG CCC GGA GTA CGC CTA CGT GTG ACT GAA GGC 
Ala Val Met Asn Met Trp Pro Gly Val Arg Leu Arg Val Thr Glu Gly 
115 120 125 



-62- 



ONV-080.01 



TGG GAC GAG GAC GGC CAC CAC GCA CAG GAT TCA CTC CAC TAC GAA GGC 
Trp Asp Glu Asp Gly His His Ala Gin Asp Ser Leu His Tyr Glu Gly 
130 135 140 

CGT GCC TTG GAC ATC ACC ACG TCT GAC CGT GAC CGT AAT AAG TAT GGT 
Arg Ala Leu Asp lie Thr Thr Ser Asp Arg Asp Arg Asn Lys Tyr Gly 
145 150 155 160 

TTG TTG GCG CGC CTA GCT GTG GAA GCC GGA TTC GAC TGG GTC TAC TAC 
Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr 
165 170 175 

GAG TCC CGC AAC CAC ATC CAC GTA TCG GTC AAA GCT GAT AAC TCA CTG 
Glu Ser Arg Asn His lie His Val Ser Val Lys Ala Asp Asn Ser Leu 
180 185 190 

GCG GTC CGA GCC GGA GGC TGC TTT CCG GGA AAT GCC ACG GTG CGC TTG 
Ala Val Arg Ala Gly Gly Cys Phe Pro Gly Asn Ala Thr Val Arg Leu 
195 200 205 

CGG AGC GGC GAA CGG AAG GGG CTG AGG GAA CTA CAT CGT GGT GAC TGG 
Arg Ser Gly Glu Arg Lys Gly Leu Arg Glu Leu His Arg Gly Asp Trp 
210 215 220 

GTA CTG GCC GCT GAT GCA GCG GGC CGA GTG GTA CCC ACG CCA GTG CTG 
Val Leu Ala Ala Asp Ala Ala Gly Arg Val Val Pro Thr Pro Val Leu 
225 230 235 240 

CTC TTC CTG GAC CGG GAT CTG CAG CGC CGC GCC TCG TTC GTG GCT GTG 
Leu Phe Leu Asp Arg Asp Leu Gin Arg Arg Ala Ser Phe Val Ala Val 
245 250 255 

GAG ACC GAG CGG CCT CCG CGC AAA CTG TTG CTC ACA CCC TGG CAT CTG 
Glu Thr Glu Arg Pro Pro Arg Lys Leu Leu Leu Thr Pro Trp His Leu 
260 265 270 

GTG TTC GCT GCT CGC GGG CCA GCG CCT GCT CCA GGT GAC TTT GCA CCG 
Val Phe Ala Ala Arg Gly Pro Ala Pro Ala Pro Gly Asp Phe Ala Pro 
275 280 285 

GTG TTC GCG CGC CGC TTA CGT GCT GGC GAC TCG GTG CTG GCT CCC GGC 
Val Phe Ala Arg Arg Leu Arg Ala Gly Asp Ser Val Leu Ala Pro Gly 
290 295 300 

GGG GAC GCG CTC CAG CCG GCG CGC GTA GCC CGC GTG GCG CGC GAG GAA 
Gly Asp Ala Leu Gin Pro Ala Arg Val Ala Arg Val Ala Arg Glu Glu 
305 310 315 320 

GCC GTG GGC GTG TTC GCA CCG CTC ACT GCG CAC GGG ACG CTG CTG GTC 
Ala Val Gly Val Phe Ala Pro Leu Thr Ala His Gly Thr Leu Leu Val 
325 330 335 

AAC GAC GTC CTC GCC TCC TGC TAC GCG GTT CTA GAG AGT CAC CAG TGG 
Asn Asp Val Leu Ala Ser Cys Tyr Ala Val Leu Glu Ser His Gin Trp 
340 345 350 

GCC CAC CGC GCC TTC GCC CCT TTG CGG CTG CTG CAC GCG CTC GGG GCT 
Ala His Arg Ala Phe Ala Pro Leu Arg Leu Leu His Ala Leu Gly Ala 
355 360 365 
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CTG CTC CCT GGG GGT GCA GTC CAG CCG ACT GGC ATG CAT TGG TAC TCT 
Leu Leu Pro Gly Gly Ala Val Gin Pro Thr Gly Met His Trp Tyr Ser 
370 375 380 



1152 



CGC CTC CTT TAC CGC TTG GCC GAG GAG TTA ATG GGC TG 
Arg Leu Leu Tyr Arg Leu Ala Glu Glu Leu Met Gly 
385 390 395 



1190 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1281 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1233 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

ATG TCT CCC GCC TGG CTC CGG CCC CGA CTG CGG TTC TGT CTG TTC CTG 4 8 

Met Ser Pro Ala Trp Leu Arg Pro Arg Leu Arg Phe Cys Leu Phe Leu 
1 5 10 15 

CTG CTG CTG CTT CTG GTG CCG GCG GCG CGG GGC TGC GGG CCG GGC CGG 96 
Leu Leu Leu Leu Leu Val Pro Ala Ala Arg Gly Cys Gly Pro Gly Arg 
20 25 30 

GTG GTG GGC AGC CGC CGG AGG CCG CCT CGC AAG CTC GTG CCT CTT GCC 144 
Val Val Gly Ser Arg Arg Arg Pro Pro Arg Lys Leu Val Pro Leu Ala 
35 40 45 

TAC AAG CAG TTC AGC CCC AAC GTG CCG GAG AAG ACC CTG GGC GCC AGC 192 
Tyr Lys Gin Phe Ser Pro Asn Val Pro Glu Lys Thr Leu Gly Ala Ser 
50 55 60 

GGG CGC TAC GAA GGC AAG ATC GCG CGC AGC TCT GAG CGC TTC AAA GAG 24 0 

Gly Arg Tyr Glu Gly Lys lie Ala Arg Ser Ser Glu Arg Phe Lys Glu 
65 70 75 80 

CTC ACC CCC AAC TAC AAT CCC GAC ATC ATC TTC AAG GAC GAG GAG AAC 2 88 

Leu Thr Pro Asn Tyr Asn Pro Asp He He Phe Lys Asp Glu Glu Asn 
85 90 95 

ACG GGT GCC GAC CGC CTC ATG ACC CAG CGC TGC AAG GAC CGT CTG AAC 336 
Thr Gly Ala Asp Arg Leu Met Thr Gin Arg Cys Lys Asp Arg Leu Asn 
100 105 110 

TCA CTG GCC ATC TCT GTC ATG AAC CAG TGG CCT GGT GTG AAA CTG CGG 3 84 

Ser Leu Ala He Ser Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg 
115 120 125 

GTG ACC GAA GGC CGG GAT GAA GAT GGC CAT CAC TCA GAG GAG TCT TTA 432 
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Val Thr Glu Gly Arg Asp Glu Asp Gly His His Ser Glu Glu Ser Leu 
130 135 140 

CAC TAT GAG GGC CGC GCG GTG GAT ATC ACC ACC TCA GAC CGT GAC CGA 
His Tyr Glu Gly Arg Ala Val Asp lie Thr Thr Ser Asp Arg Asp Arg 
145 150 155 160 

AAT AAG TAT GGA CTG CTG GCG CGC TTA GCA GTG GAG GCC GGC TTC GAC 
Asn Lys Tyr Gly Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp 
165 170 175 

TGG GTG TAT TAC GAG TCC AAG GCC CAC GTG CAT TGC TCT GTC AAG TCT 
Trp Val Tyr Tyr Glu Ser Lys Ala His Val His Cys Ser Val Lys Ser 
180 185 190 

GAG CAT TCG GCC GCT GCC AAG ACA GGT GGC TGC TTT CCT GCC GGA GCC 
Glu His Ser Ala Ala Ala Lys Thr Gly Gly Cys Phe Pro Ala Gly Ala 
195 200 . 205 

CAG GTG CGC CTA GAG AAC GGG GAG CGT GTG GCC CTG TCA GCT GTA AAG 
Gin Val Arg Leu Glu Asn Gly Glu Arg Val Ala Leu Ser Ala Val Lys 
210 215 220 

CCA GGA GAC CGG GTG CTG GCC ATG GGG GAG GAT GGG ACC CCC ACC TTC 
Pro Gly Asp Arg Val Leu Ala Met Gly Glu Asp Gly Thr Pro Thr Phe 
225 230 235 240 

AGT GAT GTG CTT ATT TTC CTG GAC CGC GAG CCA AAC CGG CTG AGA GCT 
Ser Asp Val Leu lie Phe Leu Asp Arg Glu Pro Asn Arg Leu Arg Ala 
245 250 255 

TTC CAG GTC ATC GAG ACT CAG GAT CCT CCG CGT CGG CTG GCG CTC ACG 
Phe Gin Val lie Glu Thr Gin Asp Pro Pro Arg Arg Leu Ala Leu Thr 
260 265 270 

CCT GCC CAC CTG CTC TTC ATT GCG GAC AAT CAT ACA GAA CCA GCA GCC 
Pro Ala His Leu Leu Phe lie Ala Asp Asn His Thr Glu Pro Ala Ala 
275 280 285 

CAC TTC CGG GCC ACA TTT GCC AGC CAT GTG CAA CCA GGC CAA TAT GTG 
His Phe Arg Ala Thr Phe Ala Ser His Val Gin Pro Gly Gin Tyr Val 
290 295 300 

CTG GTA TCA GGG GTA CCA GGC CTC CAG CCT GCT. CGG GTG GCA GCT GTC 
Leu Val Ser Gly Val Pro Gly Leu Gin Pro Ala Arg Val Ala Ala Val 
305 310 315 320 

TCC ACC CAC GTG GCC CTT GGG TCC TAT GCT CCT CTC ACA AGG CAT GGG 
Ser Thr His Val Ala Leu Gly Ser Tyr Ala Pro Leu Thr Arg His Gly 
325 330 335 

ACA CTT GTG GTG GAG GAT GTG GTG GCC TCC TGC TTT GCA GCT GTG GCT 
Thr Leu Val Val Glu Asp Val Val Ala Ser Cys Phe Ala Ala Val Ala 
340 345 350 

GAC CAC CAT CTG GCT CAG TTG GCC TTC TGG CCC CTG CGA CTG TTT CCC 
Asp His His Leu Ala Gin Leu Ala Phe Trp Pro Leu Arg Leu Phe Pro 
355 360 365 

AGT TTG GCA TGG GGC AGC TGG ACC CCA AGT GAG GGT GTT CAC TCC TAC 
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Ser Leu Ala Trp Gly Ser Trp Thr 
370 375 

CCT CAG ATG CTC TAC CGC CTG GGG 
Pro Gin Met Leu Tyr Arg Leu Gly 
385 390 

TTC CAT CCA CTG GGC ATG 
Phe His Pro Leu Gly Met 
405 

CCCTCCTGGA ACTGCTGTGC GTGGATCC 



Pro Ser Glu Gly Val His Ser Tyr 
380 

CGT CTC TTG CTA GAA GAG AGC ACC 
Arg Leu Leu Leu Glu Glu Ser Thr 
395 400 



TCT GGG GCA GGA AGC TGAAGGGACT CTAACCACTG 
Ser Gly Ala Gly Ser 
410 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

( D ) TOPOLOGY : 1 i ne a r 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .1314 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATG CTG CTG CTG CTG GCC AGA TGT TTT CTG GTG ATC CTT GCT TCC TCG 
Met Leu Leu Leu Leu Ala Arg Cys Phe Leu Val lie Leu Ala Ser Ser 
15 10 15 

CTG CTG GTG TGC CCC GGG CTG GCC TGT GGG CCC GGC AGG GGG TTT GGA 
Leu Leu Val Cys Pro Gly Leu Ala Cys Gly Pro Gly Arg Gly Phe Gly 
20 25 30 

AAG AGG CGG CAC CCC AAA AAG CTG ACC CCT TTA GCC TAC AAG CAG TTT 
Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe 
35 40 45 

ATT CCC AAC GTA GCC GAG AAG ACC CTA GGG GCC AGC GGC AGA TAT GAA 
lie Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu 
50 55 60 

GGG AAG ATC ACA AGA AAC TCC GAA CGA TTT AAG GAA CTC ACC CCC AAT 
Gly Lys lie Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn 
65 70 75 80 

TAC AAC CCC GAC ATC ATA TTT AAG GAT GAG GAA AAC ACG GGA GCA GAC 
Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp 
85 90 95 

CGG CTG ATG ACT CAG AGG TGC AAA GAC AAG TTA AAT GCC TTG GCC ATC 
Arg Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu Ala lie 
100 105 110 
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TCT GTG ATG AAC CAG TGG CCT GGA GTG AGG CTG CGA GTG ACC GAG GGC 
Ser Val Met Asn Gin Trp Pro Gly Val Arg Leu Arg Val Thr Glu Gly 
115 120 125 

TGG GAT GAG GAC GGC CAT CAT TCA GAG GAG TCT CTA CAC TAT GAG GGT 
Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr Glu Gly 
130 135 140 

CGA GCA GTG GAC ATC ACC ACG TCC GAC CGG GAC CGC AGC AAG TAC GGC 
Arg Ala Val Asp lie Thr Thr Ser Asp Arg Asp Arg Ser Lys Tyr Gly 
145 150 155 160 

ATG CTG GCT CGC CTG GCT GTG GAA GCA GGT TTC GAC TGG GTC TAC TAT 
Met Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr 
165 170 175 

GAA TCC AAA GCT CAC ATC CAC TGT TCT GTG AAA GCA GAG AAC TCC GTG 
Glu Ser Lys Ala His lie His Cys Ser Val Lys Ala Glu Asn Ser Val 
180 185 190 

GCG GCC AAA TCC GGC GGC TGT TTC CCG GGA TCC GCC ACC GTG CAC CTG 
Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val His Leu 
195 200 205 

GAG CAG GGC GGC ACC AAG CTG GTG AAG GAC TTA CGT CCC GGA GAC CGC 
Glu Gin Gly Gly Thr Lys Leu Val Lys Asp Leu Arg Pro Gly Asp Arg 
210 215 220 

GTG CTG GCG GCT GAC GAC CAG GGC CGG CTG CTG TAC AGC GAC TTC CTC 
Val Leu Ala Ala Asp Asp Gin Gly Arg Leu Leu Tyr Ser Asp Phe Leu 
225 230 235 240 

ACC TTC CTG GAC CGC GAC GAA GGC GCC AAG AAG GTC TTC TAC GTG ATC 
Thr Phe Leu Asp Arg Asp Glu Gly Ala Lys Lys Val Phe Tyr Val lie 
245 250 255 

GAG ACG CTG GAG CCG CGC GAG CGC CTG CTG CTC ACC GCC GCG CAC CTG 
Glu Thr Leu Glu Pro Arg Glu Arg Leu Leu Leu Thr Ala Ala His Leu 
260 265 270 

CTC TTC GTG GCG CCG CAC AAC GAC TCG GGG CCC ACG CCC GGG CCA AGC 
Leu Phe Val Ala Pro His Asn Asp Ser Gly Pro Thr Pro Gly Pro Ser 
275 280 285 

GCG CTC TTT GCC AGC CGC GTG CGC CCC GGG CAG CGC GTG TAC GTG GTG 
Ala Leu Phe Ala Ser Arg Val Arg Pro Gly Gin Arg Val Tyr Val Val 
290 295 300 

GCT GAA CGC GGC GGG GAC CGC CGG CTG CTG CCC GCC GCG GTG CAC AGC 
Ala Glu Arg Gly Gly Asp Arg Arg Leu Leu Pro Ala Ala Val His Ser 
305 310 315 320 

GTG ACG CTG CGA GAG GAG GAG GCG GGC GCG TAC GCG CCG CTC ACG GCG 
Val Thr Leu Arg Glu Glu Glu Ala Gly Ala Tyr Ala Pro Leu Thr Ala 
325 330 335 

CAC GGC ACC ATT CTC ATC AAC CGG GTG CTC GCC TCG TGC TAC GCT GTC 
His Gly Thr He Leu He Asn Arg Val Leu Ala Ser Cys Tyr Ala Val 
340 345 350 
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ATC GAG GAG CAC AGC TGG GCA CAC CGG GCC TTC GCG CCT TTC CGC CTG 1104 

lie Glu Glu His Ser Trp Ala His Arg Ala Phe Ala Pro Phe Arg Leu 
355 360 365 

5 GCG CAC GCG CTG CTG GCC GCG CTG GCA CCC GCC CGC ACG GAC GGC GGG 1152 

Ala His Ala Leu Leu Ala Ala Leu Ala Pro Ala Arg Thr Asp Gly Gly 
370 375 380 

GGC GGG GGC AGC ATC CCT GCA GCG CAA TCT GCA ACG GAA GCG AGG GGC 12 00 

10 Gly Gly Gly Ser lie Pro Ala Ala Gin Ser Ala Thr Glu Ala Arg Gly 
385 390 395 400 

GCG GAG CCG ACT GCG GGC ATC CAC TGG TAC TCG CAG CTG CTC TAC CAC 1248 

Ala Glu Pro Thr Ala Gly lie His Trp Tyr Ser Gin Leu Leu Tyr His 

15 405 410 415 

ATT GGC ACC TGG CTG TTG GAC AGC GAG ACC ATG CAT CCC TTG GGA ATG 12 96 

lie Gly Thr Trp Leu Leu Asp Ser Glu Thr Met His Pro Leu Gly Met 
420 425 430 



20 



H25 



£35 



U 40 



55 



GCG GTC AAG TCC AGC TG 1313 
Ala Val Lys Ser Ser 
435 

(2) INFORMATION FOR SEQ ID NO : 5 : 



,h (i) SEQUENCE CHARACTERISTICS: 

~ (A) LENGTH: 1256 base pairs 

^30 <B) TYPE: nucleic acid 

4* (C) STRANDEDNESS : both 

s (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..1257 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



ATG CGG CTT TTG ACG AGA GTG CTG CTG GTG TCT CTT CTC ACT CTG TCC 48 
45 Met Arg Leu Leu Thr Arg Val Leu Leu Val Ser Leu Leu Thr Leu Ser 
15 10 15 

TTG GTG GTG TCC GGA CTG GCC TGC GGT CCT GGC AGA GGC TAC GGC AGA 96 
Leu Val Val Ser Gly Leu Ala Cys Gly Pro Gly Arg Gly Tyr Gly Arg 
50 20 25 30 

AGA AGA CAT CCG AAG AAG CTG ACA CCT CTC GCC TAC AAG CAG TTC ATA 144 
Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe lie 
35 40 45 



CCT AAT GTC GCG GAG AAG ACC TTA GGG GCC AGC GGC AGA TAC GAG GGC 192 
Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu Gly 
50 55 60 



60 AAG ATA ACG CGC AAT TCG GAG AGA TTT AAA GAA CTT ACT CCA AAT TAC 24 0 
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Lys lie Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn Tyr 
65 70 75 80 

AAT CCC GAC ATT ATC TTT AAG GAT GAG GAG AAC ACG GGA GCG GAC AGG 
Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp Arg 
85 90 95 

CTC ATG ACA CAG AGA TGC AAA GAC AAG CTG AAC TCG CTG GCC ATC TCT 
Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ser Leu Ala lie Ser 
100 105 110 

GTA ATG AAC CAC TGG CCA GGG GTT AAG CTG CGT GTG ACA GAG GGC TGG 
Val Met Asn His Trp Pro Gly Val Lys Leu Arg Val Thr Glu Gly Trp 
115 120 125 

GAT GAG GAC GGT CAC CAT TTT GAA GAA TCA CTC CAC TAC GAG GGA AGA 
Asp Glu Asp Gly His His Phe Glu Glu Ser Leu His Tyr Glu Gly Arg 
130 135 140 

GCT GTT GAT ATT ACC ACC TCT GAC CGA GAC AAG AGC AAA TAC GGG ACA 
Ala Val Asp lie Thr Thr Ser Asp Arg Asp Lys Ser Lys Tyr Gly Thr 
145 150 155 160 

CTG TCT CGC CTA GCT GTG GAG GCT GGA TTT GAC TGG GTC TAT TAC GAG 
Leu Ser Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr Glu 
165 170 175 

TCC AAA GCC CAC ATT CAT TGC TCT GTC AAA GCA GAA AAT TCG GTT GCT 
Ser Lys Ala His lie His Cys Ser Val Lys Ala Glu Asn Ser Val Ala 
180 185 190 

GCG AAA TCT GGG GGC TGT TTC CCA GGT TCG GCT CTG GTC TCG CTC CAG 
Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Leu Val Ser Leu Gin 
195 200 205 

GAC GGA GGA CAG AAG GCC GTG AAG GAC CTG AAC CCC GGA GAC AAG GTG 
Asp Gly Gly Gin Lys Ala Val Lys Asp Leu Asn Pro Gly Asp Lys Val 
210 215 220 

CTG GCG GCA GAC AGC GCG GGA AAC CTG GTG TTC AGC GAC TTC ATC ATG 
Leu Ala Ala Asp Ser Ala Gly Asn Leu Val Phe Ser Asp Phe lie Met 
225 230 235 240 

TTC ACA GAC CGA GAC TCC ACG ACG CGA CGT GTG TTT TAC GTC ATA GAA 
Phe Thr Asp Arg Asp Ser Thr Thr Arg Arg Val Phe Tyr Val lie Glu 
245 250 255 

ACG CAA GAA CCC GTT GAA AAG ATC ACC CTC ACC GCC GCT CAC CTC CTT 
Thr Gin Glu Pro Val Glu Lys lie Thr Leu Thr Ala Ala His Leu Leu 
260 265 270 

TTT GTC CTC GAC AAC TCA ACG GAA GAT CTC CAC ACC ATG ACC GCC GCG 
Phe Val Leu Asp Asn Ser Thr Glu Asp Leu His Thr Met Thr Ala Ala 
275 280 285 

TAT GCC AGC AGT GTC AGA GCC GGA CAA AAG GTG ATG GTT GTT GAT GAT 
Tyr Ala Ser Ser Val Arg Ala Gly Gin Lys Val Met Val Val Asp Asp 
290 295 300 

AGC GGT CAG CTT AAA TCT GTC ATC GTG CAG CGG ATA TAC ACG GAG GAG 
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Ser Gly Gin Leu Lys Ser Val lie Val Gin Arg lie Tyr Thr Glu Glu 
305 310 315 320 

CAG CGG GGC TCG TTC GCA CCA GTG ACT GCA CAT GGG ACC ATT GTG GTC 
Gin Arg Gly Ser Phe Ala Pro Val Thr Ala His Gly Thr He Val Val 
325 330 335 

GAC AGA ATA CTG GCG TCC TGT TAC GCC GTA ATA GAG GAC CAG GGG CTT 
Asp Arg He Leu Ala Ser Cys Tyr Ala Val He Glu Asp Gin Gly Leu 
340 345 350 

GCG CAT TTG GCC TTC GCG CCC GCC AGG CTC TAT TAT TAC GTG TCA TCA 
Ala His Leu Ala Phe Ala Pro Ala Arg Leu Tyr Tyr Tyr Val Ser Ser 
355 360 365 

TTC CTG TCC CCC AAA ACT CCA GCA GTC GGT CCA ATG CGA CTT TAC AAC 
Phe Leu Ser Pro Lys Thr Pro Ala Val Gly Pro Met Arg Leu Tyr Asn 
370 375 380 

AGG AGG GGG TCC ACT GGT ACT CCA GGC TCC TGT CAT CAA ATG GGA ACG 
Arg Arg Gly Ser Thr Gly Thr Pro Gly Ser Cys His Gin Met Gly Thr 
385 390 395 400 

TGG CTT TTG GAC AGC AAC ATG CTT CAT CCT TTG GGG ATG TCA GTA AAC 
Trp Leu Leu Asp Ser Asn Met Leu His Pro Leu Gly Met Ser Val Asn 
405 410 415 

TCA AGC TG 
Ser Ser 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1425 



(xi) SEQUENi 

ATG CTG CTG CTG 
Met Leu Leu Leu 
1 

CTG GTA TGC TCG 
Leu Val Cys Ser 
20 

AGG AGG CAC CCC 
Arg Arg His Pro 



IE DESCRIPTION: i 

GCG AGA TGT CTG 
Ala Arg Cys Leu 
5 

GGA CTG GCG TGC 
Gly Leu Ala Cys 



AAA AAG CTG ACC 
Lys Lys Leu Thr 



!EQ ID NO: 6: 

CTG CTA GTC CTC 

Leu Leu Val Leu 
10 

GGA CCG GGC AGG 

Gly Pro Gly Arg 
25 

CCT TTA GCC TAC 

Pro Leu Ala Tyr 



GTC TCC TCG CTG 
Val Ser Ser Leu 
15 

GGG TTC GGG AAG 
Gly Phe Gly Lys 
30 

AAG CAG TTT ATC 
Lys Gin Phe He 
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35 40 45 

CCC AAT GTG GCC GAG AAG ACC CTA GGC GCC AGC GGA AGG TAT GAA GGG 
Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu Gly 
50 55 60 

AAG ATC TCC AGA AAC TCC GAG CGA TTT AAG GAA CTC ACC CCC AAT TAC 
Lys lie Ser Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn Tyr 
65 70 75 80 

AAC CCC GAC ATC ATA TTT AAG GAT GAA GAA AAC ACC GGA GCG GAC AGG 
Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp Arg 
85 90 95 

CTG ATG ACT CAG AGG TGT AAG GAC AAG TTG AAC GCT TTG GCC ATC TCG 
Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu Ala lie Ser 
100 105 110 

GTG ATG AAC CAG TGG CCA GGA GTG AAA CTG CGG GTG ACC GAG GGC TGG 
Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg Val Thr Glu Gly Trp 
115 120 125 

GAC GAA GAT GGC CAC CAC TCA GAG GAG TCT CTG CAC TAC GAG GGC CGC 
Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr Glu Gly Arg 
130 135 140 

GCA GTG GAC ATC ACC ACG TCT GAC CGC GAC CGC AGC AAG TAC GGC ATG 
Ala Val Asp lie Thr Thr Ser Asp Arg Asp Arg Ser Lys Tyr Gly Met 
145 150 155 160 

CTG GCC CGC CTG GCG GTG GAG GCC GGC TTC GAC TGG GTG TAC TAC GAG 
Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr Glu 
165 170 175 

TCC AAG GCA CAT ATC CAC TGC TCG GTG AAA GCA GAG AAC TCG GTG GCG 
Ser Lys Ala His lie His Cys Ser Val Lys Ala Glu Asn Ser Val Ala 
180 185 190 

GCC AAA TCG GGA GGC TGC TTC CCG GGC TCG GCC ACG GTG CAC CTG GAG 
Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val His Leu Glu 
195 200 205 

CAG GGC GGC ACC AAG CTG GTG AAG GAC CTG AGC CCC GGG GAC CGC GTG 
Gin Gly Gly Thr Lys Leu Val Lys Asp Leu Ser Pro Gly Asp Arg Val 
210 215 220 

CTG GCG GCG GAC GAC CAG GGC CGG CTG CTC TAC AGC GAC TTC CTC ACT 
Leu Ala Ala Asp Asp Gin Gly Arg Leu Leu Tyr Ser Asp Phe Leu Thr 
225 230 235 240 

TTC CTG GAC CGC GAC GAC GGC GCC AAG AAG GTC TTC TAC GTG ATC GAG 
Phe Leu Asp Arg Asp Asp Gly Ala Lys Lys Val Phe Tyr Val lie Glu 
245 250 255 

ACG CGG GAG CCG CGC GAG CGC CTG CTG CTC ACC GCC GCG CAC CTG CTC 
Thr Arg Glu Pro Arg Glu Arg Leu Leu Leu Thr Ala Ala His Leu Leu 
260 265 270 

TTT GTG GCG CCG CAC AAC GAC TCG GCC ACC GGG GAG CCC GAG GCG TCC 
Phe Val Ala Pro His Asn Asp Ser Ala Thr Gly Glu Pro Glu Ala Ser 
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275 280 285 

TCG GGC TCG GGG CCG CCT TCC GGG GGC GCA CTG GGG CCT CGG GCG CTG 
Ser Gly Ser Gly Pro Pro Ser Gly Gly Ala Leu Gly Pro Arg Ala Leu 
290 295 300 

TTC GCC AGC CGC GTG CGC CCG GGC CAG CGC GTG TAC GTG GTG GCC GAG 
Phe Ala Ser Arg Val Arg Pro Gly Gin Arg Val Tyr Val Val Ala Glu 
305 310 315 320 

CGT GAC GGG GAC CGC CGG CTC CTG CCC GCC GCT GTG CAC AGC GTG ACC 
Arg Asp Gly Asp Arg Arg Leu Leu Pro Ala Ala Val His Ser Val Thr 
325 330 335 

CTA AGC GAG GAG GCC GCG GGC GCC TAC GCG CCG CTC ACG GCC CAG GGC 
Leu Ser Glu Glu Ala Ala Gly Ala Tyr Ala Pro Leu Thr Ala Gin Gly 
340 345 350 

ACC ATT CTC ATC AAC CGG GTG CTG GCC TCG TGC TAC GCG GTC ATC GAG 
Thr lie Leu lie Asn Arg Val Leu Ala Ser Cys Tyr Ala Val lie Glu 
355 360 365 

GAG CAC AGC TGG GCG CAC CGG GCC TTC GCG CCC TTC CGC CTG GCG CAC 
Glu His Ser Tip Ala His Arg Ala Phe Ala Pro Phe Arg Leu Ala His 
370 375 380 

GCG CTC CTG GCT GCA CTG GCG CCC GCG CGC ACG GAC CGC GGC GGG GAC 
Ala Leu Leu Ala Ala Leu Ala Pro Ala Arg Thr Asp Arg Gly Gly Asp 
385 390 395 400 

AGC GGC GGC GGG GAC CGC GGG GGC GGC GGC GGC AGA GTA GCC CTA ACC 
Ser Gly Gly Gly Asp Arg Gly Gly Gly Gly Gly Arg Val Ala Leu Thr 
405 410 415 

GCT CCA GGT GCT GCC GAC GCT CCG GGT GCG GGG GCC ACC GCG GGC ATC 
Ala Pro Gly Ala Ala Asp Ala Pro Gly Ala Gly Ala Thr Ala Gly He 
420 425 430 

CAC TGG TAC TCG CAG CTG CTC TAC CAA ATA GGC ACC TGG CTC CTG GAC 
His Trp Tyr Ser Gin Leu Leu Tyr Gin He Gly Thr Trp Leu Leu Asp 
435 440 445 

AGC GAG GCC CTG CAC CCG CTG GGC ATG GCG GTC AAG TCC AGC NNN AGC 
Ser Glu Ala Leu His Pro Leu Gly Met Ala Val Lys Ser Ser Xaa Ser 
450 455 460 

CGG GGG GCC GGG GGA GGG GCG CGG GAG GGG GCC 
Arg Gly Ala Gly Gly Gly Ala Arg Glu Gly Ala 
465 470 475 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 51.. 1283 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

CATCAGCCCA CCAGGAGACC TCGCCCGCCG CTCCCCCGGG CTCCCCGGCC ATG TCT 

Met Ser 
1 

CCC GCC CGG CTC CGG CCC CGA CTG CAC TTC TGC CTG GTC CTG TTG CTG 
Pro Ala Arg Leu Arg Pro Arg Leu His Phe Cys Leu Val Leu Leu Leu 
5 10 15 

CTG CTG GTG GTG CCC GCG GCA TGG GGC TGC GGG CCG GGT CGG GTG GTG 
Leu Leu Val Val Pro Ala Ala Trp Gly Cys Gly Pro Gly Arg Val Val 
20 25 30 

GGC AGC CGC CGG CGA CCG CCA CGC AAA CTC GTG CCG CTC GCC TAC AAG 
Gly Ser Arg Arg Arg Pro Pro Arg Lys Leu Val Pro Leu Ala Tyr Lys 
35 40 45 50 

CAG TTC AGC CCC AAT GTG CCC GAG AAG ACC CTG GGC GCC AGC GGA CGC 
Gin Phe Ser Pro Asn Val Pro Glu Lys Thr Leu Gly Ala Ser Gly Arg 
55 60 65 

TAT GAA GGC AAG ATC GCT CGC AGC TCC GAG CGC TTC AAG GAG CTC ACC 
Tyr Glu Gly Lys lie Ala Arg Ser Ser Glu Arg Phe Lys Glu Leu Thr 
70 75 80 

CCC AAT TAC AAT CCA GAC ATC ATC TTC AAG GAC GAG GAG AAC ACA GGC 
Pro Asn Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Thr Gly 
85 90 95 

GCC GAC CGC CTC ATG ACC CAG CGC TGC AAG GAC CGC CTG AAC TCG CTG 
Ala Asp Arg Leu Met Thr Gin Arg Cys Lys Asp Arg Leu Asn Ser Leu 
100 105 110 

GCT ATC TCG GTG ATG AAC CAG TGG CCC GGT GTG AAG CTG CGG GTG ACC 
Ala lie Ser Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg Val Thr 
115 120 125 130 

GAG GGC TGG GAC GAG GAC GGC CAC CAC TCA GAG GAG TCC CTG CAT TAT 
Glu Gly Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr 
135 140 145 

GAG GGC CGC GCG GTG GAC ATC ACC ACA TCA GAC CGC GAC CGC AAT AAG 
Glu Gly Arg Ala Val Asp lie Thr Thr Ser Asp Arg Asp Arg Asn Lys 
150 155 160 

TAT GGA CTG CTG GCG CGC TTG GCA GTG GAG GCC GGC TTT GAC TGG GTG 
Tyr Gly Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val 
165 170 175 

TAT TAC GAG TCA AAG GCC CAC GTG CAT TGC TCC GTC AAG TCC GAG CAC 
Tyr Tyr Glu Ser Lys Ala His Val His Cys Ser Val Lys Ser Glu His 
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180 185 190 

TCG GCC GCA GCC AAG ACG GGC GGC TGC TTC CCT GCC GGA GCC CAG GTA 
Ser Ala Ala Ala Lys Thr Gly Gly Cys Phe Pro Ala Gly Ala Gin Val 
195 200 205 210 

CGC CTG GAG AGT GGG GCG CGT GTG GCC TTG TCA GCC GTG AGG CCG GGA 
Arg Leu Glu Ser Gly Ala Arg Val Ala Leu Ser Ala Val Arg Pro Gly 
215 220 225 

GAC CGT GTG CTG GCC ATG GGG GAG GAT GGG AGC CCC ACC TTC AGC GAT 
Asp Arg Val Leu Ala Met Gly Glu Asp Gly Ser Pro Thr Phe Ser Asp 
230 235 240 

GTG CTC ATT TTC CTG GAC CGC GAG CCC CAC AGG CTG AGA GCC TTC CAG 
Val Leu lie Phe Leu Asp Arg Glu Pro His Arg Leu Arg Ala Phe Gin 
245 250 255 

GTC ATC GAG ACT CAG GAC CCC CCA CGC CGC CTG GCA CTC ACA CCC GCT 
Val lie Glu Thr Gin Asp Pro Pro Arg Arg Leu Ala Leu Thr Pro Ala 
260 265 270 

CAC CTG CTC TTT ACG GCT GAC AAT CAC ACG GAG CCG GCA GCC CGC TTC 
His Leu Leu Phe Thr Ala Asp Asn His Thr Glu Pro Ala Ala Arg Phe 
275 280 285 290 

CGG GCC ACA TTT GCC AGC CAC GTG CAG CCT GGC CAG TAC GTG CTG GTG 
Arg Ala Thr Phe Ala Ser His Val Gin Pro Gly Gin Tyr Val Leu Val 
295 300 305 

GCT GGG GTG CCA GGC CTG CAG CCT GCC CGC GTG GCA GCT GTC TCT ACA 
Ala Gly Val Pro Gly Leu Gin Pro Ala Arg Val Ala Ala Val Ser Thr 
310 315 320 

CAC GTG GCC CTC GGG GCC TAC GCC CCG CTC ACA AAG CAT GGG ACA CTG 
His Val Ala Leu Gly Ala Tyr Ala Pro Leu Thr Lys His Gly Thr Leu 
325 330 335 

GTG GTG GAG GAT GTG GTG GCA TCC TGC TTC GCG GCC GTG GCT GAC CAC 
Val Val Glu Asp Val Val Ala Ser Cys Phe Ala Ala Val Ala Asp His 
340 345 350 

CAC CTG GCT CAG TTG GCC TTC TGG CCC CTG AGA CTC TTT CAC AGC TTG 
His Leu Ala Gin Leu Ala Phe Trp Pro Leu Arg Leu Phe His Ser Leu 
355 360 365 370 

GCA TGG GGC AGC TGG ACC CCG GGG GAG GGT GTG CAT TGG TAC CCC CAG 
Ala Trp Gly Ser Trp Thr Pro Gly Glu Gly Val His Trp Tyr Pro Gin 
375 380 385 

CTG CTC TAC CGC CTG GGG CGT CTC CTG CTA GAA GAG GGC AGC TTC CAC 
Leu Leu Tyr Arg Leu Gly Arg Leu Leu Leu Glu Glu Gly Ser Phe His 
390 395 400 

CCA CTG GGC ATG TCC GGG GCA GGG AGC TGAAAGGACT CCACCGCTGC 
Pro Leu Gly Met Ser Gly Ala Gly Ser 
405 410 

CCTCCTGGAA CTGCTGTACT GGGTCCAGAA GCCTCTCAGC CAGGAGGGAG CTGGCCCTGG 
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AAGGGACCTG AGCTGGGGGA CACTGGCTCC TGCCATCTCC TCTGCCATGA AGATACACCA 142 3 

TTGAGACTTG ACTGGGCAAC ACCAGCGTCC CCCACCCGCG TCGTGGTGTA GTCATAGAGC 1483 

TGCAAGCTGA GCTGGCGAGG GGATGGTTGT TGACCCCTCT CTCCTAGAGA CCTTGAGGCT 1543 

GGCACGGCGA CTCCCAACTC AGCCTGCTCT CACTACGAGT TTTCATACTC TGCCTCCCCC 1603 

ATTGGGAGGG CCCATTCCC 1622 
(2) INFORMATION FOR SEQ ID NO : 8 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1191 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
%y (B) LOCATION: 1..1191 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



ATG GCT CTC CTG ACC AAT CTA CTG CCC TTG TGC TGC TTG GCA CTT CTG 4 8 



0^30 Met Ala Leu Leu Thr Asn Leu Leu Pro Leu Cys Cys Leu Ala Leu Leu 
4= 1 5 10 15 



GCG CTG CCA GCC CAG AGC TGC GGG CCG GGC CGG GGG CCG GTT GGC CGG 96 
|^ Ala Leu Pro Ala Gin Ser Cys Gly Pro Gly Arg Gly Pro Val Gly Arg 

L-35 20 25 30 



£H CGC CGC TAT GCG CGC AAG CAG CTC GTG CCG CTA CTC TAC AAG CAA TTT 144 

Q Arg Arg Tyr Ala Arg Lys Gin Leu Val Pro Leu Leu Tyr Lys Gin Phe 

35 40 45 



GTG CCC GGC GTG CCA GAG CGG ACC CTG GGC GCC AGT GGG CCA GCG GAG 192 

Val Pro Gly Val Pro Glu Arg Thr Leu Gly Ala Ser Gly Pro Ala Glu 
50 55 60 

45 GGG AGG GTG GCA AGG GGC TCC GAG CGC TTC CGG GAC CTC GTG CCC AAC 24 0 

Gly Arg Val Ala Arg Gly Ser Glu Arg Phe Arg Asp Leu Val Pro Asn 
65 70 75 80 

TAC AAC CCC GAC ATC ATC TTC AAG GAT GAG GAG AAC AGT GGA GCC GAC 2 88 

50 Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Ser Gly Ala Asp 

85 90 95 

CGC CTG ATG ACC GAG CGT TGC AAG GAG AGG GTG AAC GCT TTG GCC ATT 336 

Arg Leu Met Thr Glu Arg Cys Lys Glu Arg Val Asn Ala Leu Ala lie 

55 100 105 110 

GCC GTG ATG AAC ATG TGG CCC GGA GTG CGC CTA CGA GTG ACT GAG GGC 3 84 

Ala Val Met Asn Met Trp Pro Gly Val Arg Leu Arg Val Thr Glu Gly 
115 120 125 

60 
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TGG 


GAC 


GAG 


GAC 


GGC 


CAC 


CAC 


GCT 


CAG 


GAT 


TCA 


CTC 


CAC 


TAC 


GAA 


GGC 


432 






Trp 


Asp 


Glu 


Asp 


Gly 


His 


His 


Ala 


Gin 


Asp 


Ser 


Leu 


His 


Tyr 


Glu 


Gly 










130 










135 










140 














5 


CGT 


GCT 


TTG 


GAC 


ATC 


ACT 


ACG 


TCT 


GAC 


CGC 


GAC 


CGC 


AAC 


AAG 


TAT 


GGG 


480 






Arg 


Ala 


Leu 


Asp 


He 


Thr 


Thr 


Ser 


Asp 


Arg 


Asp 


Arg 


Asn 


Lys 


Tyr 


Gly 








145 










150 










155 










160 








TTG 


CTG 


GCG 


CGC 


CTC 


GCA 


GTG 


GAA 


GCC 


GGC 


TTC 


GAC 


TGG 


GTC 


TAC 


TAC 


528 




10 


Leu 


Leu 


Ala 


Arg 


Leu 


Ala 


Val 


Glu 


Ala 


Gly 


Phe 


Asp 


Trp 


Val 


Tyr 


Tyr 
















165 










170 










175 










GAG 


TCC 


CGC 


AAC 


CAC 


GTC 


CAC 


GTG 


TCG 


GTC 


AAA 


GCT 


GAT 


AAC 


TCA 


CTG 


576 






Glu 


Ser 


Arg 


Asn 


His 


Val 


His 


Val 


Ser 


Val 


Lys 


Ala 


Asp 


Asn 


Ser 


Leu 






15 








180 










185 










190 












GCG 


GTC 


CGG 


GCG 


GGC 


GGC 


TGC 


TTT 


CCG 


GGA 


AAT 


GCA 


ACT 


GTG 


CGC 


CTG 


624 






Ala 


Val 


Arg 


Ala 


Gly 


Gly 


Cys 


Phe 


Pro 


Gly 


Asn 


Ala 


Thr 


Val 


Arg 


Leu 












195 










200 










205 












20 








































TGG 


AGC 


GGC 


GAG 


CGG 


AAA 


GGG 


CTG 


CGG 


GAA 


CTG 


CAC 


CGC 


GGA 


GAC 


TGG 


672 






Trp 


Ser 


Gly 


Glu 


Arg 


Lys 


Gly 


Leu 


Arg 


Glu 


Leu 


His 


Arg 


Gly 


Asp 


Trp 










210 










215 










220 














25 


GTT 


TTG 


GCG 


GCC 


GAT 


GCG 


TCA 


GGC 


CGG 


GTG 


GTG 


CCC 


ACG 


CCG 


GTG 


CTG 


720 


rl 1 
i ^ 




Val 


Leu 


Ala 


Ala 


Asp 


Ala 


Ser 


Gly 


Arg 


Val 


Val 


Pro 


Thr 


Pro 


Val 


Leu 




£ 




225 










230 










235 










240 




m 


30 


CTC 


TTC 


CTG 


GAC 


CGG 


GAC 


TTG 


CAG 


CGC 


CGG 


GCT 


TCA 


TTT 


GTG 


GCT 


GTG 


768 




Leu 


Phe 


Leu 


Asp 


Arg 


Asp 


Leu 


Gin 


Arg 


Arg 


Ala 


Ser 


Phe 


Val 


Ala 


Val 




P 












245 










250 










255 










GAG 


ACC 


GAG 


TGG 


CCT 


CCA 


CGC 


AAA 


CTG 


TTG 


CTC 


ACG 


CCC 


TGG 


CAC 


CTG 


816 


y. 


35 


Glu 


Thr 


Glu 


Trp 


Pro 


Pro 


Arg 


Lys 


Leu 


Leu 


Leu 


Thr 


Pro 


Trp 


His 


Leu 




! y 








260 










265 










270 








03 




GTG 


TTT 


GCC 


GCT 


CGA 


GGG 


CCG 


GCG 


CCC 


GCG 


CCA 


GGC 


GAC 


TTT 


GCA 


CCG 


864 


0 




Val 


Phe 


Ala 


Ala 


Arg 


Gly 


Pro 


Ala 


Pro 


Ala 


Pro 


Gly Asp 


Phe 


Ala 


Pro 












275 










280 










285 










t c 


40 








































GTG 


TTC 


GCG 


CGC 


CGG 


CTA 


CGC 


GCT 


GGG 


GAC 


TCG 


GTG 


CTG 


GCG 


CCC 


GGC 


912 






Val 


Phe 


Ala 


Arg 


Arg 


Leu 


Arg 


Ala 


Gly 


Asp 


Ser 


Val 


Leu 


Ala 


Pro 


Gly 










290 










295 










300 














45 


GGG 


GAT 


GCG 


CTT 


CGG 


CCA 


GCG 


CGC 


GTG 


GCC 


CGT 


GTG 


GCG 


CGG 


GAG 


GAA 


960 






Gly 


Asp 


Ala 


Leu 


Arg 


Pro 


Ala 


Arg 


Val 


Ala 


Arg 


Val 


Ala 


Arg 


Glu 


Glu 








305 










310 










315 










320 






PA 

50 


GCC 


GTG 


GGC 


GTG 


TTC 


GCG 


CCG 


CTC 


ACC 


GCG 


CAC 


GGG 


ACG 


CTG 


CTG 


GTG 


1008 




Ala 


Val 


Gly 


Val 


Phe 


Ala 


Pro 


Leu 


Thr 


Ala 


His 


Gly Thr 


Leu 


Leu 


Val 
















325 










330 










335 










AAC 


GAT 


GTC 


CTG 


GCC 


TCT 


TGC 


TAC 


GCG 


GTT 


CTG 


GAG 


AGT 


CAC 


CAG 


TGG 


1056 




55 


Asn 


Asp 


Val 


Leu 


Ala 


Ser 


Cys 


Tyr 


Ala 


Val 


Leu 


Glu 


Ser 


His 


Gin 


Trp 












340 










345 










350 












GCG 


CAC 


CGC 


GCT 


TTT 


GCC 


CCC 


TTG 


AGA 


CTG 


CTG 


CAC 


GCG 


CTA 


GGG 


GCG 


1104 






Ala 


His 


Arg 


Ala 


Phe 


Ala 


Pro 


Leu 


Arg 


Leu 


Leu 


His 


Ala 


Leu 


Gly 


Ala 





355 360 365 

60 
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CTG CTC CCC GGC GGG GCC GTC CAG CCG ACT GGC ATG CAT TGG TAC TCT 
Leu Leu Pro Gly Gly Ala Val Gin Pro Thr Gly Met His Trp Tyr Ser 
370 375 380 

CGG CTC CTC TAC CGC TTA GCG GAG GAG CTA CTG GGC TG 
Arg Leu Leu Tyr Arg Leu Ala Glu Glu Leu Leu Gly 
385 390 395 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .1248 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ATG GAC GTA AGG CTG CAT CTG AAG CAA TTT GCT TTA CTG TGT TTT ATC 
Met Asp Val Arg Leu His Leu Lys Gin Phe Ala Leu Leu Cys Phe lie 
15 10 15 

AGC TTG CTT CTG ACG CCT TGT GGA TTA GCC TGT GGT CCT GGT AGA GGT 
Ser Leu Leu Leu Thr Pro Cys Gly Leu Ala Cys Gly Pro Gly Arg Gly 
20 25 30 

TAT GGA AAA CGA AGA CAC CCA AAG AAA TTA ACC CCG TTG GCT TAC AAG 
Tyr Gly Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys 
35 40 45 

CAA TTC ATC CCC AAC GTT GCT GAG AAA ACG CTT GGA GCC AGC GGC AAA 
Gin Phe lie Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Lys 
50 55 60 

TAC GAA GGC AAA ATC ACA AGG AAT TCA GAG AGA TTT AAA GAG CTG ATT 
Tyr Glu Gly Lys lie Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu lie 
65 70 75 80 

CCG AAT TAT AAT CCC GAT ATC ATC TTT AAG GAC GAG GAA AAC ACA AAC 
Pro Asn Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Thr Asn 
85 90 95 

GCT GAC AGG CTG ATG ACC AAG CGC TGT AAG GAC AAG TTA AAT TCG TTG 
Ala Asp Arg Leu Met Thr Lys Arg Cys Lys Asp Lys Leu Asn Ser Leu 
100 105 110 



GCC ATA TCC GTC ATG AAC CAC TGG CCC GGC GTG AAA CTG CGC GTC ACT 
Ala lie Ser Val Met Asn His Trp Pro Gly Val Lys Leu Arg Val Thr 
115 120 125 
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GAA GGC TGG GAT GAG GAT GGT CAC CAT TTA GAA GAA TCT TTG CAC TAT 
Glu Gly Trp Asp Glu Asp Gly His His Leu Glu Glu Ser Leu His Tyr 
130 135 140 

GAG GGA CGG GCA GTG GAC ATC ACT ACC TCA GAC AGG GAT AAA AGC AAG 
Glu Gly Arg Ala Val Asp lie Thr Thr Ser Asp Arg Asp Lys Ser Lys 
145 150 155 160 

TAT GGG ATG CTA TCC AGG CTT GCA GTG GAG GCA GGA TTC GAC TGG GTC 
Tyr Gly Met Leu Ser Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val 
165 170 175 

TAT TAT GAA TCT AAA GCC CAC ATA CAC TGC TCT GTC AAA GCA GAA AAT 
Tyr Tyr Glu Ser Lys Ala His lie His Cys Ser Val Lys Ala Glu Asn 
180 185 190 

TCA GTG GCT GCT AAA TCA GGA GGA TGT TTT CCT GGG TCT GGG ACG GTG 
Ser Val Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Gly Thr Val 
195 200 205 

ACA CTT GGT GAT GGG ACG AGG AAA CCC ATC AAA GAT CTT AAA GTG GGC 
Thr Leu Gly Asp Gly Thr Arg Lys Pro lie Lys Asp Leu Lys Val Gly 
210 215 220 

GAC CGG GTT TTG GCT GCA GAC GAG AAG GGA AAT GTC TTA ATA AGC GAC 
Asp Arg Val Leu Ala Ala Asp Glu Lys Gly Asn Val Leu lie Ser Asp 
225 230 235 240 

TTT ATT ATG TTT ATA GAC CAC GAT CCG ACA ACG AGA AGG CAA TTC ATC 
Phe lie Met Phe lie Asp His Asp Pro Thr Thr Arg Arg Gin Phe lie 
245 250 255 

GTC ATC GAG ACG TCA GAA CCT TTC ACC AAG CTC ACC CTC ACT GCC GCG 
Val lie Glu Thr Ser Glu Pro Phe Thr Lys Leu Thr Leu Thr Ala Ala 
260 265 270 

CAC CTA GTT TTC GTT GGA AAC TCT TCA GCA GCT TCG GGT ATA ACA GCA 
His Leu Val Phe Val Gly Asn Ser Ser Ala Ala Ser Gly lie Thr Ala 
275 280 285 

ACA TTT GCC AGC AAC GTG AAG CCT GGA GAT ACA GTT TTA GTG TGG GAA 
Thr Phe Ala Ser Asn Val Lys Pro Gly Asp Thr Val Leu Val Trp Glu 
290 295 300 

GAC ACA TGC GAG AGC CTC AAG AGC GTT ACA GTG AAA AGG ATT TAC ACT 
Asp Thr Cys Glu Ser Leu Lys Ser Val Thr Val Lys Arg lie Tyr Thr 
305 310 315 320 

GAG GAG CAC GAG GGC TCT TTT GCG CCA GTC ACC GCG CAC GGA ACC ATA 
Glu Glu His Glu Gly Ser Phe Ala Pro Val Thr Ala His Gly Thr He 
325 330 335 

ATA GTG GAT CAG GTG TTG GCA TCG TGC TAC GCG GTC ATT GAG AAC CAC 
He Val Asp Gin Val Leu Ala Ser Cys Tyr Ala Val He Glu Asn His 
340 345 350 

AAA TGG GCA CAT TGG GCT TTT GCG CCG GTC AGG TTG TGT CAC AAG CTG 
Lys Trp Ala His Trp Ala Phe Ala Pro Val Arg Leu Cys His Lys Leu 
355 360 365 
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ATG ACG TGG CTT TTT CCG GCT CGT GAA TCA AAC GTC AAT TTT CAG GAG 

Met Thr Trp Leu Phe Pro Ala Arg Glu Ser Asn Val Asn Phe Gin Glu 
370 375 380 

GAT GGT ATC CAC TGG TAC TCA AAT ATG CTG TTT CAC ATC GGC TCT TGG 

Asp Gly lie His Trp Tyr Ser Asn Met Leu Phe His lie Gly Ser Trp 
385 390 395 400 

CTG CTG GAC AGA GAC TCT TTC CAT CCA CTC GGG ATT TTA CAC TTA AGT 

Leu Leu Asp Arg Asp Ser Phe His Pro Leu Gly lie Leu His Leu Ser 

405 410 415 

TGA 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Val Glu Met Leu Leu Leu Thr Arg lie Leu Leu Val Gly Phe lie 
. 1 5 10 15 

Cys Ala Leu Leu Val Ser Ser Gly Leu Thr Cys Gly Pro Gly Arg Gly 
20 25 30 

lie Gly Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys 
35 40 45 

Gin Phe lie Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg 
50 55 60 

Tyr Glu Gly Lys lie Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr 
65 70 75 ' 80 

Pro Asn Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Thr Gly 
85 90 95 

Ala Asp Arg Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu 
100 105 110 

Ala lie Ser Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg Val Thr 
115 120 125 

Glu Gly Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr 
130 135 140 

Glu Gly Arg Ala Val Asp lie Thr Thr Ser Asp Arg Asp Arg Ser Lys 
145 150 155 160 

Tyr Gly Met Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val 
165 170 175 
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Tyr Tyr Glu Ser Lys Ala His lie His Cys Ser Val Lys Ala Glu Asn 
180 185 190 

Ser Val Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val 
195 200 205 

His Leu Glu His Gly Gly Thr Lys Leu Val Lys Asp Leu Ser Pro Gly 
210 215 220 

Asp Arg Val Leu Ala Ala Asp Ala Asp Gly Arg Leu Leu Tyr Ser Asp 
225 230 235 240 

Phe Leu Thr Phe Leu Asp Arg Met Asp Ser Ser Arg Lys Leu Phe Tyr 
245 250 255 

Val lie Glu Thr Arg Gin Pro Arg Ala Arg Leu Leu Leu Thr Ala Ala 
260 265 270 

His Leu Leu Phe Val Ala Pro Gin His Asn Gin Ser Glu Ala Thr Gly 
275 280 285 

Ser Thr Ser Gly Gin Ala Leu Phe Ala Ser Asn Val Lys Pro Gly Gin 
290 295 300 

Arg Val Tyr Val Leu Gly Glu Gly Gly Gin Gin Leu Leu Pro Ala Ser 
305 310 315 320 

Val His Ser Val Ser Leu Arg Glu Glu Ala Ser Gly Ala Tyr Ala Pro 
325 330 335 

Leu Thr Ala Gin Gly Thr lie Leu lie Asn Arg Val Leu Ala Ser Cys 
340 345 350 

Tyr Ala Val lie Glu Glu His Ser Trp Ala His Trp Ala Phe Ala Pro 
355 360 365 

Phe Arg Leu Ala Gin Gly Leu Leu Ala Ala Leu Cys Pro Asp Gly Ala 
370 375 380 

lie Pro Thr Ala Ala Thr Thr Thr Thr Gly lie His Trp Tyr Ser Arg 
385 390 395 400 

Leu Leu Tyr Arg lie Gly Ser Trp Val Leu Asp Gly Asp Ala Leu His 
405 410 415 

Pro Leu Gly Met Val Ala Pro Ala Ser 
420 425 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Ala Leu Pro Ala Ser Leu Leu Pro Leu Cys Cys Leu Ala Leu Leu 
15 10 15 

Ala Leu Ser Ala Gin Ser Cys Gly Pro Gly Arg Gly Pro Val Gly Arg 
20 25 30 

Arg Arg Tyr Val Arg Lys Gin Leu Val Pro Leu Leu Tyr Lys Gin Phe 
35 40 45 

Val Pro Ser Met Pro Glu Arg Thr Leu Gly Ala Ser Gly Pro Ala Glu 
50 55 60 

Gly Arg Val Thr Arg Gly Ser Glu Arg Phe Arg Asp Leu Val Pro Asn 
65 70 75 80 

Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Ser Gly Ala Asp 
85 90 95 




Arg Leu Met Thr Glu Arg Cys Lys Glu Arg Val Asn Ala Leu Ala lie 

100 105 110 

Ala Val Met Asn Met Trp Pro Gly Val Arg Leu Arg Val Thr Glu Gly 

115 120 125 

Trp Asp Glu Asp Gly His His Ala Gin Asp Ser Leu His Tyr Glu Gly 

130 135 140 



Arg Ala Leu Asp lie Thr Thr Ser Asp Arg Asp Arg Asn Lys Tyr Gly 
145 150 155 160 

Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr 
165 170 175 

Glu Ser Arg Asn His lie His Val Ser Val Lys Ala Asp Asn Ser Leu 
180 185 190 

Ala Val Arg Ala Gly Gly Cys Phe Pro Gly Asn Ala Thr Val Arg Leu 
195 200 205 

Arg Ser Gly Glu Arg Lys Gly Leu Arg Glu Leu His Arg Gly Asp Trp 
210 215 220 

Val Leu Ala Ala Asp Ala Ala Gly Arg Val Val Pro Thr Pro Val Leu 
225 230 235 240 

Leu Phe Leu Asp Arg Asp Leu Gin Arg Arg Ala Ser Phe Val Ala Val 
245 250 255 

Glu Thr Glu Arg Pro Pro Arg Lys Leu Leu Leu Thr Pro Trp His Leu 
260 265 270 

Val Phe Ala Ala Arg Gly Pro Ala Pro Ala Pro Gly Asp Phe Ala Pro 
275 280 285 

Val Phe Ala Arg Arg Leu Arg Ala Gly Asp Ser Val Leu Ala Pro Gly 
290 295 300 



Gly Asp Ala Leu Gin Pro Ala Arg Val Ala Arg Val Ala Arg Glu Glu 
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305 310 315 320 

Ala Val Gly Val Phe Ala Pro Leu Thr Ala His Gly Thr Leu Leu Val 
325 330 335 

5 

Asn Asp Val Leu Ala Ser Cys Tyr Ala Val Leu Glu Ser His Gin Trp 
340 345 350 

Ala His Arg Ala Phe Ala Pro Leu Arg Leu Leu His Ala Leu Gly Ala 
10 355 360 365 

Leu Leu Pro Gly Gly Ala Val Gin Pro Thr Gly Met His Trp Tyr Ser 
370 375 380 

15 Arg Leu Leu Tyr Arg Leu Ala Glu Glu Leu Met Gly 
385 390 395 



(2) INFORMATION FOR SEQ ID NO : 12 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 
^ (D) TOPOLOGY: linear 

y 25 

nj (ii) MOLECULE TYPE: protein 

. h 

HI 30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

sz 

s Met Ser Pro Ala Trp Leu Arg Pro Arg Leu Arg Phe Cys Leu Phe Leu 

U . 1 5 10 15 

35 Leu Leu Leu Leu Leu Val Pro Ala Ala Arg Gly Cys Gly Pro Gly Arg 

L™ . 20 25 30 

□ Val Val Gly Ser Arg Arg Arg Pro Pro Arg Lys Leu Val Pro Leu Ala 

n 35 40 45 

^40 

Tyr Lys Gin Phe Ser Pro Asn Val Pro Glu Lys Thr Leu Gly Ala Ser 
50 55 60 

Gly Arg Tyr Glu Gly Lys lie Ala Arg Ser Ser Glu Arg Phe Lys Glu 
45 65 70 75 80 

Leu Thr Pro Asn Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn 
85 90 95 

50 Thr Gly Ala Asp Arg Leu Met Thr Gin Arg Cys Lys Asp Arg Leu Asn 
100 105 110 

Ser Leu Ala lie Ser Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg 
115 120 125 

Val Thr Glu Gly Arg Asp Glu Asp Gly His His Ser Glu Glu Ser Leu 
130 135 140 



His Tyr Glu Gly Arg Ala Val Asp lie Thr Thr Ser Asp Arg Asp Arg 
60 145 150 155 160 
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Asn Lys Tyr Gly Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp 
165 170 175 

Trp Val Tyr Tyr Glu Ser Lys Ala His Val His Cys Ser Val Lys Ser 
180 185 190 

Glu His Ser Ala Ala Ala Lys Thr Gly Gly Cys Phe Pro Ala Gly Ala 
195 200 205 

Gin Val Arg Leu Glu Asn Gly Glu Arg Val Ala Leu Ser Ala Val Lys 
210 215 220 

Pro Gly Asp Arg Val Leu Ala Met Gly Glu Asp Gly Thr Pro Thr Phe 
225 230 235 240 

Ser Asp Val Leu lie Phe Leu Asp Arg Glu Pro Asn Arg Leu Arg Ala 
245 250 255 

Phe Gin Val lie Glu Thr Gin Asp Pro Pro Arg Arg Leu Ala Leu Thr 
260 265 270 

Pro Ala His Leu Leu Phe lie Ala Asp Asn His Thr Glu Pro Ala Ala 
275 280 285 

His Phe Arg Ala Thr Phe Ala Ser His Val Gin Pro Gly Gin Tyr Val 
290 295 300 

Leu Val Ser Gly Val Pro Gly Leu Gin Pro Ala Arg Val Ala Ala Val 
305 310 315 320 

Ser Thr His Val Ala Leu Gly Ser Tyr Ala Pro Leu Thr Arg His Gly 
325 330 335 

Thr Leu Val Val Glu Asp Val Val Ala Ser Cys Phe Ala Ala Val Ala 
340 345 350 

Asp His His Leu Ala Gin Leu Ala Phe Trp Pro Leu Arg Leu Phe Pro 
355 360 365 

Ser Leu Ala Trp Gly Ser Trp Thr Pro Ser Glu Gly Val His Ser Tyr 
370 375 380 

Pro Gin Met Leu Tyr Arg Leu Gly Arg Leu Leu Leu Glu Glu Ser Thr 
385 390 395 400 

Phe His Pro Leu Gly Met Ser Gly Ala Gly Ser 
405 410 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

Met Leu Leu Leu Leu Ala Arg Cys Phe Leu Val lie Leu Ala Ser Ser 
15 10 15 

Leu Leu Val Cys Pro Gly Leu Ala Cys Gly Pro Gly Arg Gly Phe Gly 
20 25 30 

Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe 
35 40 45 

lie Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu 
50 55 60 

Gly Lys lie Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn 
65 70 75 80 

Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp 
85 90 95 

Arg Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu Ala lie 
100 105 110 

Ser Val Met Asn Gin Trp Pro Gly Val Arg Leu Arg Val Thr Glu Gly 
115 120 125 

Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr Glu Gly 
130 135 140 

Arg Ala Val Asp lie Thr Thr Ser Asp Arg Asp Arg Ser Lys Tyr Gly 
145 150 155 160 

Met Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr 
165 170 175 

Glu Ser Lys Ala His lie His Cys Ser Val Lys Ala Glu Asn Ser Val 
180 185 190 

Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val His Leu 
195 200 205 

Glu Gin Gly Gly Thr Lys Leu Val Lys Asp Leu Arg Pro Gly Asp Arg 
210 215 220 

Val Leu Ala Ala Asp Asp Gin Gly Arg Leu Leu Tyr Ser Asp Phe Leu 
225 230 235 240 

Thr Phe Leu Asp Arg Asp Glu Gly Ala Lys Lys Val Phe Tyr Val lie 
245 250 255 

Glu Thr Leu Glu Pro Arg Glu Arg Leu Leu Leu Thr Ala Ala His Leu 
260 265 270 

Leu Phe Val Ala Pro His Asn Asp Ser Gly Pro Thr Pro Gly Pro Ser 
275 280 285 

Ala Leu Phe Ala Ser Arg Val Arg Pro Gly Gin Arg Val Tyr Val Val 
290 295 300 
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10 



45 



Ala Glu Arg Gly Gly Asp Arg Arg Leu Leu Pro Ala Ala Val His Ser 
305 310 315 320 

Val Thr Leu Arg Glu Glu Glu Ala Gly Ala Tyr Ala Pro Leu Thr Ala 
325 330 335 

His Gly Thr lie Leu lie Asn Arg Val Leu Ala Ser Cys Tyr Ala Val 
340 345 350 

lie Glu Glu His Ser Trp Ala His Arg Ala Phe Ala Pro Phe Arg Leu 
355 360 365 



Ala His Ala Leu Leu Ala Ala Leu Ala Pro Ala Arg Thr Asp Gly Gly 
15 370 375 380 

Gly Gly Gly Ser lie Pro Ala Ala Gin Ser Ala Thr Glu Ala Arg Gly 
385 390 395 400 

20 Ala Glu Pro Thr Ala Gly lie His Trp Tyr Ser Gin Leu Leu Tyr His 

405 410 415 

lie Gly Thr Trp Leu Leu Asp . Ser Glu Thr Met His Pro Leu Gly Met 
« 420 425 430 

H25 

fy Ala Val Lys Ser Ser 

ffj30 (2) INFORMATION FOR SEQ ID NO : 14 : 

« (i) SEQUENCE CHARACTERISTICS: 
y. (A) LENGTH: 418 amino acids 

y, (B) TYPE: amino acid 

:- s 35 (D) TOPOLOGY: linear 

m 

□ 

°40 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Arg Leu Leu Thr Arg Val Leu Leu Val Ser Leu Leu Thr Leu Ser 
15 10 15 

Leu Val Val Ser Gly Leu Ala Cys Gly Pro Gly Arg Gly Tyr Gly Arg 
20 25 30 

Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe lie 
50 35 40 45 

Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu Gly 
50 55 60 

55 Lys lie Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn Tyr 
65 1 70 75 80 



60 



Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp Arg 
85 90 95 
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Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ser Leu Ala lie Ser 
100 105 110 

Val Met Asn His Trp Pro Gly Val Lys Leu Arg Val Thr Glu Gly Trp 
115 120 125 

Asp Glu Asp Gly His His Phe Glu Glu Ser Leu His Tyr Glu Gly Arg 
130 135 140 

Ala Val Asp lie Thr Thr Ser Asp Arg Asp Lys Ser Lys Tyr Gly Thr 
145 150 155 160 

Leu Ser Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr Glu 
165 170 175 

Ser Lys Ala His lie His Cys Ser Val Lys Ala Glu Asn Ser Val Ala 
180 185 190 

Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Leu Val Ser Leu Gin 
195 200 205 

Asp Gly Gly Gin Lys Ala Val Lys Asp Leu Asn Pro Gly Asp Lys Val 
210 215 220 

Leu Ala Ala Asp Ser Ala Gly Asn Leu Val Phe Ser Asp Phe lie Met 
225 230 235 240 

Phe Thr Asp Arg Asp Ser Thr Thr Arg Arg Val Phe Tyr Val lie Glu 
245 250 255 

Thr Gin Glu Pro Val Glu Lys lie Thr Leu Thr Ala Ala His Leu Leu 
260 265 270 

Phe Val Leu Asp Asn Ser Thr Glu Asp Leu His Thr Met Thr Ala Ala 
275 280 285 

Tyr Ala Ser Ser Val Arg Ala Gly Gin Lys Val Met Val Val Asp Asp 
290 295 300 

Ser Gly Gin Leu Lys Ser Val lie Val Gin Arg lie Tyr Thr Glu Glu 
305 310 315 320 

Gin Arg Gly Ser Phe Ala Pro Val Thr Ala His Gly Thr He Val Val 
325 330 335 

Asp Arg He Leu Ala Ser Cys Tyr Ala Val He Glu Asp Gin Gly Leu 
340 345 350 

Ala His Leu Ala Phe Ala Pro Ala Arg Leu Tyr Tyr Tyr Val Ser Ser 
355 360 365 

Phe Leu Ser Pro Lys Thr Pro Ala Val Gly Pro Met Arg Leu Tyr Asn 
370 375 380 

Arg Arg Gly Ser Thr Gly Thr Pro Gly Ser Cys His Gin Met Gly Thr 
385 390 395 400 

Trp Leu Leu Asp Ser Asn Met Leu His Pro Leu Gly Met Ser Val Asn 
405 410 415 
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Ser Ser 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 475 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Leu Leu Leu Ala Arg Cys Leu Leu Leu Val Leu Val Ser Ser Leu 
1 5 10 15 

Leu Val Cys Ser Gly Leu Ala Cys Gly Pro Gly Arg Gly Phe Gly Lys 
20 25 30 

Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe lie 
35 40 45 

Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu Gly 
50 55 60 

Lys lie Ser Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn Tyr 
65 70 75 80 

Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp Arg 
85 90 95 

Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu Ala lie Ser 
100 105 110 

Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg Val Thr Glu Gly Trp 
115 120 125 

Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr Glu Gly Arg 
130 135 140 

Ala Val Asp lie Thr Thr Ser Asp Arg Asp Arg Ser Lys Tyr Gly Met 
145 150 155 160 

Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr Glu 
165 170 175 

Ser Lys Ala His lie His Cys Ser Val Lys Ala Glu Asn Ser Val Ala 
180 185 190 

Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val His Leu Glu 
195 200 205 

Gin Gly Gly Thr Lys Leu Val Lys Asp Leu Ser Pro Gly Asp Arg Val 
210 215 220 

Leu Ala Ala Asp Asp Gin Gly Arg Leu Leu Tyr Ser Asp Phe Leu Thr 
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225 230 235 240 

Phe Leu Asp Arg Asp Asp Gly Ala Lys Lys Val Phe Tyr Val lie Glu 
245 250 255 

Thr Arg Glu Pro Arg Glu Arg Leu Leu Leu Thr Ala Ala His Leu Leu 
260 265 270 

Phe Val Ala Pro His Asn Asp Ser Ala Thr Gly Glu Pro Glu Ala Ser 
275 280 285 

Ser Gly Ser Gly Pro Pro Ser Gly Gly Ala Leu Gly Pro Arg Ala Leu 
290 295 300 

Phe Ala Ser Arg Val Arg Pro Gly Gin Arg Val Tyr Val Val Ala Glu 
305 310 315 320 

Arg Asp Gly Asp Arg Arg Leu Leu Pro Ala Ala Val His Ser Val Thr 
325 330 335 

Leu Ser Glu Glu Ala Ala Gly Ala Tyr Ala Pro Leu Thr Ala Gin Gly 
340 345 350 

Thr lie Leu lie Asn Arg Val Leu Ala Ser Cys Tyr Ala Val lie Glu 
355 360 365 

Glu His Ser Trp Ala His Arg Ala Phe Ala Pro Phe Arg Leu Ala His 
370 375 380 

Ala Leu Leu Ala Ala Leu Ala Pro Ala Arg Thr Asp Arg Gly Gly Asp 
385 390 395 400 

Ser Gly Gly Gly Asp Arg Gly Gly Gly Gly Gly Arg Val Ala Leu Thr 
405 410 415 

Ala Pro Gly Ala Ala Asp Ala Pro Gly Ala Gly Ala Thr Ala Gly lie 
420 425 430 

His Trp Tyr Ser Gin Leu Leu Tyr Gin lie Gly Thr Trp Leu Leu Asp 
435 440 445 

Ser Glu Ala Leu His Pro Leu Gly Met Ala Val Lys Ser Ser Xaa Ser 
450 455 460 

Arg Gly Ala Gly Gly Gly Ala Arg Glu Gly Ala 
465 470 475 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

Met Ser Pro Ala Arg Leu Arg Pro Arg Leu His Phe Cys Leu Val Leu 
15 10 15 
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10 



25 



40 



55 



Leu Leu Leu Leu Val Val Pro Ala Ala Trp Gly Cys Gly Pro Gly Arg 
20 25 30 

Val Val Gly Ser Arg Arg Arg Pro Pro Arg Lys Leu Val Pro Leu Ala 
35 40 45 

Tyr Lys Gin Phe Ser Pro Asn Val Pro Glu Lys Thr Leu Gly Ala Ser 
50 55 60 

Gly Arg Tyr Glu Gly Lys lie Ala Arg Ser Ser Glu Arg Phe Lys Glu 
65 70 75 80 



Leu Thr Pro Asn Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn 
15 85 90 95 

Thr Gly Ala Asp Arg Leu Met Thr Gin Arg Cys Lys Asp Arg Leu Asn 
100 105 110 

20 Ser Leu Ala lie Ser Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg 
115 120 125 



Val Thr Glu Gly Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu 

130 135 140 

His Tyr Glu Gly Arg Ala Val Asp lie Thr Thr Ser Asp Arg Asp Arg 

145 150 155 160 



Asn Lys Tyr Gly Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp 
30 165 170 175 . 

Trp Val Tyr Tyr Glu Ser Lys Ala His Val His Cys Ser Val Lys Ser 
180 185 190 

35 Glu His Ser Ala Ala Ala Lys Thr Gly Gly Cys Phe Pro Ala Gly Ala 
195 200 205 



Gin Val Arg Leu Glu Ser Gly Ala Arg Val Ala Leu Ser Ala Val Arg 
210 215 220 

Pro Gly Asp Arg Val Leu Ala Met Gly Glu Asp Gly Ser Pro Thr Phe 

225 230 235 240 



Ser Asp Val Leu lie Phe Leu Asp Arg Glu Pro His Arg Leu Arg Ala 
45 245 250 255 

Phe Gin Val lie Glu Thr Gin Asp Pro Pro Arg Arg Leu Ala Leu Thr 
260 265 270 

50 Pro Ala His Leu Leu Phe Thr Ala Asp Asn His Thr Glu Pro Ala Ala 
275 280 285 



Arg Phe Arg Ala Thr Phe Ala Ser His Val Gin Pro Gly Gin Tyr Val 
290 295 300 

Leu Val Ala Gly Val Pro Gly Leu Gin Pro Ala Arg Val Ala Ala Val 
305 310 315 320 



Ser Thr His Val Ala Leu Gly Ala Tyr Ala Pro Leu Thr Lys His Gly 
60 325 330 335 
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Thr Leu Val Val Glu Asp Val Val Ala Ser Cys Phe Ala Ala Val Ala 
340 345 350 

Asp His His Leu Ala Gin Leu Ala Phe Trp Pro Leu Arg Leu Phe His 
355 360 365 

Ser Leu Ala Trp Gly Ser Trp Thr Pro Gly Glu Gly Val His Trp Tyr 
370 375 380 

Pro Gin Leu Leu Tyr Arg Leu Gly Arg Leu Leu Leu Glu Glu Gly Ser 
385 390 395 400 

Phe His Pro Leu Gly Met Ser Gly Ala Gly Ser 
405 410 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 96 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Ala Leu Leu Thr Asn Leu Leu Pro Leu Cys Cys Leu Ala Leu Leu 
1 5 10 15 

Ala Leu Pro Ala Gin Ser Cys Gly Pro Gly Arg Gly Pro Val Gly Arg 
20 25 30 

Arg Arg Tyr Ala Arg Lys Gin Leu Val Pro Leu Leu Tyr Lys Gin Phe 
35 40 45 

Val Pro Gly Val Pro Glu Arg Thr Leu Gly Ala Ser Gly Pro Ala Glu 
50 55 60 

Gly Arg Val Ala Arg Gly Ser Glu Arg Phe Arg Asp Leu Val Pro Asn 
65 70 75 80 

Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Ser Gly Ala Asp 
85 90 95 

Arg Leu Met Thr Glu Arg Cys Lys Glu Arg Val Asn Ala Leu Ala lie 
100 105 110 

Ala Val Met Asn Met Trp Pro Gly Val Arg Leu Arg Val Thr Glu Gly 
115 120 125 

Trp Asp Glu Asp Gly His His Ala Gin Asp Ser Leu His Tyr Glu Gly 
130 135 140 

Arg Ala Leu Asp lie Thr Thr Ser Asp Arg Asp Arg Asn Lys Tyr Gly 
145 150 155 160 

Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr 
165 170 175 
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Glu Ser Arg Asn His Val His Val Ser Val Lys Ala Asp Asn Ser Leu 
180 185 190 

Ala Val Arg Ala Gly Gly Cys Phe Pro Gly Asn Ala Thr Val Arg Leu 
195 200 205 

Trp Ser Gly Glu Arg Lys Gly Leu Arg Glu Leu His Arg Gly Asp Trp 
210 215 220 

Val Leu Ala Ala Asp Ala Ser Gly Arg Val Val Pro Thr Pro Val Leu 
225 230 235 240 

Leu Phe Leu Asp Arg Asp Leu Gin Arg Arg Ala Ser Phe Val Ala Val 
245 250 255 

Glu Thr Glu Trp Pro Pro Arg Lys Leu Leu Leu Thr Pro Trp His Leu 
260 265 270 

Val Phe Ala Ala Arg Gly Pro Ala Pro Ala Pro Gly Asp Phe Ala Pro 
275 280 285 

Val Phe Ala Arg Arg Leu Arg Ala Gly Asp Ser Val Leu Ala Pro Gly 
290 295 300 

Gly Asp Ala Leu Arg Pro Ala Arg Val Ala Arg Val Ala Arg Glu Glu 
305 310 315 320 

Ala Val Gly Val Phe Ala Pro Leu Thr Ala His Gly Thr Leu Leu Val 
325 330 335 

Asn Asp Val Leu Ala Ser Cys Tyr Ala Val Leu Glu Ser His Gin Trp 
340 345 350 

Ala His Arg Ala Phe Ala Pro Leu Arg Leu Leu His Ala Leu Gly Ala 
355 360 365 

Leu Leu Pro Gly Gly Ala Val Gin Pro Thr Gly Met His Trp Tyr Ser 
370 375 380 

Arg Leu Leu Tyr Arg Leu Ala Glu Glu Leu Leu Gly 
385 390 395 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 416 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18: 

Met Asp Val Arg Leu His Leu Lys Gin Phe Ala Leu Leu Cys Phe lie 
15 10 15 

Ser Leu Leu Leu Thr Pro Cys Gly Leu Ala Cys Gly Pro Gly Arg Gly 
20 25 30 
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10 



J2 

25 



in 



u 40 



55 



Tyr Gly Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys 
35 40 45 

Gin Phe lie Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Lys 
50 55 60 

Tyr Glu Gly Lys lie Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu lie 
65 70 75 80 

Pro Asn Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Thr Asn 
85 90 95 



Ala Asp Arg Leu Met Thr Lys Arg Cys Lys Asp Lys Leu Asn Ser Leu 
15 100 105 110 

Ala lie Ser Val Met Asn His Trp Pro Gly Val Lys Leu Arg Val Thr 

115 120 125 

20 Glu Gly Trp Asp Glu Asp Gly His His Leu Glu Glu Ser Leu His Tyr 
130 135 140 



Glu Gly Arg Ala Val Asp lie Thr Thr Ser Asp Arg Asp Lys Ser Lys 

145 150 155 160 

Tyr Gly Met Leu Ser Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val 

165 170 175 



J3 Tyr Tyr Glu Ser Lys Ala His lie His Cys Ser Val Lys Ala Glu Asn 

01 30 180 185 190 

7 Ser Val Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Gly Thr Val 

j\ 195 200 205 

35 Thr Leu Gly Asp Gly Thr Arg Lys Pro lie Lys Asp Leu Lys Val Gly 

ilJ 210 215 220 



Asp Arg Val Leu Ala Ala Asp Glu Lys Gly Asn Val Leu lie Ser Asp 
225 230 235 240 

Phe lie Met Phe lie Asp His Asp Pro Thr Thr Arg Arg Gin Phe lie 
245 250 255 



Val lie Glu Thr Ser Glu Pro Phe Thr Lys Leu Thr Leu Thr Ala Ala 
45 260 265 270 

His Leu Val Phe Val Gly Asn Ser Ser Ala Ala Ser Gly He Thr Ala 
275 280 285 

50 Thr Phe Ala Ser Asn Val Lys Pro Gly Asp Thr Val Leu Val Trp Glu 
290 295 300 



Asp Thr Cys Glu Ser Leu Lys Ser Val Thr Val Lys Arg He Tyr Thr 
305 310 315 320 

Glu Glu His Glu Gly Ser Phe Ala Pro Val Thr Ala His Gly Thr He 
325 330 335 



He Val Asp Gin Val Leu Ala Ser Cys Tyr Ala Val He Glu Asn His 
60 340 345 350 
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Lys Trp Ala His 
355 

Met Thr Trp Leu 
370 

Asp Gly lie His 
385 

Leu Leu Asp Arg 



Trp Ala Phe Ala 
360 

Phe Pro Ala Arg 
375 

Trp Tyr Ser Asn 
390 

Asp Ser Phe His 
405 



Pro Val Arg Leu 



Glu Ser Asn Val 
380 

Met Leu Phe His 
395 

Pro Leu Gly lie 
410 



Cys His Lys Leu 
365 

Asn Phe Gin Glu 



lie Gly Ser Trp 
400 

Leu His Leu Ser 
415 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1416 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE : 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..1413 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ATG GAT AAC CAC AGC TCA GTG CCT TGG GCC AGT GCC GCC AGT GTC ACC 
Met Asp Asn His Ser Ser Val Pro Trp Ala Ser Ala Ala Ser Val Thr 
1 5 10 15 

TGT CTC TCC CTG GGA TGC CAA ATG CCA CAG TTC CAG TTC CAG TTC CAG 
Cys Leu Ser Leu Gly Cys Gin Met Pro Gin Phe Gin Phe Gin Phe Gin 
20 25 30 

CTC CAA ATC CGC AGC GAG CTC CAT CTC CGC AAG CCC GCA AGA AGA ACG 
Leu Gin lie Arg Ser Glu Leu His Leu Arg Lys Pro Ala Arg Arg Thr 
35 40 45 

CAA ACG ATG CGC CAC ATT GCG CAT ACG CAG CGT TGC CTC AGC AGG CTG 
Gin Thr Met Arg His lie Ala His Thr Gin Arg Cys Leu Ser Arg Leu 
50 55 60 

ACC TCT CTG GTG GCC CTG CTG CTG ATC GTC TTG CCG ATG GTC TTT AGC 
Thr Ser Leu Val Ala Leu Leu Leu He Val Leu Pro Met Val Phe Ser 
65 70 75 80 

CCG GCT CAC AGC TGC GGT CCT GGC CGA GGA TTG GGT CGT CAT AGG GCG 
Pro Ala His Ser Cys Gly Pro Gly Arg Gly Leu Gly Arg His Arg Ala 
85 90 95 

CGC AAC CTG TAT CCG CTG GTC CTC AAG CAG ACA ATT CCC AAT CTA TCC 
Arg Asn Leu Tyr Pro Leu Val Leu Lys Gin Thr He Pro Asn Leu Ser 
100 105 110 
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GAG TAC ACG AAC AGC GCC TCC GGA CCT CTG GAG GGT GTG ATC CGT CGG 
Glu Tyr Thr Asn Ser Ala Ser Gly Pro Leu Glu Gly Val lie Arg Arg 
115 120 125 

GAT TCG CCC AAA TTC AAG GAC CTC GTG CCC AAC TAC AAC AGG GAC ATC 
Asp Ser Pro Lys Phe Lys Asp Leu Val Pro Asn Tyr Asn Arg Asp lie 
130 135 140 

CTT TTC CGT GAC GAG GAA GGC ACC GGA GCG GAT GGC TTG ATG AGC AAG 
Leu Phe Arg Asp Glu Glu Gly Thr Gly Ala Asp Gly Leu Met Ser Lys 
145 150 155 160 



CGC TGC AAG GAG AAG CTA AAC GTG CTG GCC TAC TCG GTG 
Arg Cys Lys Glu Lys Leu Asn Val Leu Ala Tyr Ser Val 
165 170 



ATG AAC GAA 
Met Asn Glu 
175 



TGG CCC GGC ATC CGG CTG CTG GTC ACC GAG AGC TGG GAC GAG GAC TAC 
Trp Pro Gly lie Arg Leu Leu Val Thr Glu Ser Trp Asp Glu Asp Tyr 
180 185 190 

CAT CAC GGC CAG GAG TCG CTC CAC TAC GAG GGC CGA GCG GTG ACC ATT 
His His Gly Gin Glu Ser Leu His Tyr Glu Gly Arg Ala Val Thr He 
195 200 205 

GCC ACC TCC GAT CGC GAC CAG TCC AAA TAC GGC ATG CTC GCT CGC CTG 
Ala Thr Ser Asp Arg Asp Gin Ser Lys Tyr Gly Met Leu Ala Arg Leu 
210 215 220 

GCC GTC GAG GCT GGA TTC GAT TGG GTC TCC TAC GTC AGC AGG CGC CAC 
Ala Val Glu Ala Gly Phe Asp Trp Val Ser Tyr Val Ser Arg Arg His 
225 230 235 240 

ATC TAC TGC TCC GTC AAG TCA GAT TCG TCG ATC AGT TCC CAC GTG CAC 
lie Tyr Cys Ser Val Lys Ser Asp Ser Ser He Ser Ser His Val His 
245 250 255 

GGC TGC TTC ACG CCG GAG AGC ACA GCG CTG CTG GAG AGT GGA GTC CGG 
Gly Cys Phe Thr Pro Glu Ser Thr Ala Leu Leu Glu Ser Gly Val Arg 
260 265 270 

AAG CCG CTC GGC GAG CTC TCT ATC GGA GAT CGT GTT TTG AGC ATG ACC 
Lys Pro Leu Gly Glu Leu Ser He Gly Asp Arg Val Leu Ser Met Thr 
275 280 285 

GCC AAC GGA CAG GCC GTC TAC AGC GAA GTG ATC CTC TTC ATG GAC CGC 
Ala Asn Gly Gin Ala Val Tyr Ser Glu Val He Leu Phe Met Asp Arg 
290 295 300 

AAC CTC GAG CAG ATG CAA AAC TTT GTG CAG CTG CAC ACG GAC GGT GGA 
Asn Leu Glu Gin Met Gin Asn Phe Val Gin Leu His Thr Asp Gly Gly 
305 310 315 320 



GCA GTG CTC ACG GTG ACG CCG GCT CAC CTG GTT AGC GTT 
Ala Val Leu Thr Val Thr Pro Ala His Leu Val Ser Val 
325 330 



TGG CAG CCG 
Trp Gin Pro 
335 



GAG AGC CAG AAG CTC ACG TTT GTG TTT GCG CAT CGC ATC GAG GAG AAG 
Glu Ser Gin Lys Leu Thr Phe Val Phe Ala His Arg He Glu Glu Lys 
340 345 350 
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AAC CAG GTG CTC GTA CGG GAT GTG GAG ACG GGC GAG CTG AGG CCC CAG 
Asn Gin Val Leu Val Arg Asp Val Glu Thr Gly Glu Leu Arg Pro Gin 
355 360 365 

CGA GTG GTC AAG TTG GGC AGT GTG CGC AGT AAG GGC GTG GTC GCG CCG 
Arg Val Val Lys Leu Gly Ser Val Arg Ser Lys Gly Val Val Ala Pro 
370 375 380 

CTG ACC CGC GAG GGC ACC ATT GTG GTC AAC TCG GTG GCC GCC AGT TGC 
Leu Thr Arg Glu Gly Thr lie Val Val Asn Ser Val Ala Ala Ser Cys 
385 390 395 400 

TAT GCG GTG ATC AAC AGT CAG TCG CTG GCC CAC TGG GGA CTG GCT CCC 
Tyr Ala Val lie Asn Ser Gin Ser Leu Ala His Trp Gly Leu Ala Pro 
405 410 415 

ATG CGC CTG CTG TCC ACG CTG GAG GCG TGG CTG CCC GCC AAG GAG CAG 
Met Arg Leu Leu Ser Thr Leu Glu Ala Trp Leu Pro Ala Lys Glu Gin 
420 425 430 

TTG CAC AGT TCG CCG AAG GTG GTG AGC TCG GCG CAG CAG CAG AAT GGC 
Leu His Ser Ser Pro Lys Val Val Ser Ser Ala Gin Gin Gin Asn Gly 
435 440 445 

ATC CAT TGG TAT GCC AAT GCG CTC TAG AAG GTC AAG GAC TAC GTG CTG 
lie His Trp Tyr Ala Asn Ala Leu Tyr Lys Val Lys Asp Tyr Val Leu 
450 455 460 

CCG CAG AGC TGG CGC CAC GAT TGA 
Pro Gin Ser Trp Arg His Asp 
465 470 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Met Asp Asn. His Ser Ser Val Pro Trp Ala Ser Ala Ala Ser Val Thr 
1 5 10 15 

Cys Leu Ser Leu Gly Cys Gin Met Pro Gin Phe Gin Phe Gin Phe Gin 
20 25 30 

Leu Gin lie Arg Ser Glu Leu His Leu Arg Lys Pro Ala Arg Arg Thr 
35 40 45 

Gin Thr Met Arg His lie Ala His Thr Gin Arg Cys Leu Ser Arg Leu 
50 55 60 

Thr Ser Leu Val Ala Leu Leu Leu lie Val Leu Pro Met Val Phe Ser 
65 70 75 80 

Pro Ala His Ser Cys Gly Pro Gly Arg Gly Leu Gly Arg His Arg Ala 
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Arg Asn Leu Tyr 
100 

Glu Tyr Thr Asn 
115 

Asp Ser Pro Lys 
130 

Leu Phe Arg Asp 
145 

Arg Cys Lys Glu 



Trp Pro Gly lie 
180 

His His Gly Gin 
195 

Ala Thr Ser Asp 
210 

Ala Val Glu Ala 
225 

lie Tyr Cys Ser 



Gly Cys Phe Thr 
260 

Lys Pro Leu Gly 
275 

Ala Asn Gly Gin 
290 

Asn Leu Glu Gin 
305 

Ala Val Leu Thr 



Glu Ser Gin Lys 
340 

Asn Gin Val Leu 
355 

Arg Val Val Lys 
370 

Leu Thr Arg Glu 
385 

Tyr Ala Val He 



85 

Pro Leu Val Leu 



Ser Ala Ser Gly 
120 

Phe Lys Asp Leu 
135 

Glu Glu Gly Thr 
150 

Lys Leu Asn Val 
165 

Arg Leu Leu Val 



Glu Ser Leu His 
200 

Arg Asp Gin Ser 
215 

Gly Phe Asp Trp 
230 

Val Lys Ser Asp 
245 

Pro Glu Ser Thr 



Glu Leu Ser He 
280 

Ala Val Tyr Ser 
295 

Met Gin Asn Phe 
310 

Val Thr Pro Ala 
325 

Leu Thr Phe Val 



Val Arg Asp Val 
360 

Leu Gly Ser Val 
375 

Gly Thr He Val 
390 

Asn Ser Gin Ser 



90 

Lys Gin Thr He 
105 

Pro Leu Glu Gly 



Val Pro Asn Tyr 
140 

Gly Ala Asp Gly 
155 . 

Leu Ala Tyr Ser 
170 

Thr Glu Ser Trp 
185 

Tyr Glu Gly Arg 



Lys Tyr Gly Met 
220 

Val Ser Tyr Val 
235 

Ser Ser He Ser 
250 

Ala Leu Leu Glu 
265 

Gly Asp Arg Val 



Glu Val He Leu 
300 

Val Gin Leu His 
315 

His Leu Val Ser 
330 

Phe Ala His Arg 
345 

Glu Thr Gly Glu 



Arg Ser Lys Gly 
380 

Val Asn Ser Val 
395 

Leu Ala His Trp 



95 

Pro Asn Leu Ser 
110 

Val He Arg Arg 
125 

Asn Arg Asp He 



Leu Met Ser Lys 
160 

Val Met Asn Glu 
175 

Asp Glu Asp Tyr 
190 

Ala Val Thr He 
205 

Leu Ala Arg Leu 



Ser Arg Arg His 
240 

Ser His Val His 
255 

Ser Gly Val Arg 
270 

Leu Ser Met Thr 
285 

Phe Met Asp Arg 



Thr Asp Gly Gly 
320 

Val Trp Gin Pro 
335 

He Glu Glu Lys 
350 

Leu Arg Pro Gin 
365 

Val Val Ala Pro 



Ala Ala Ser Cys 
400 

Gly Leu Ala Pro 
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405 410 415 

Met Arg Leu Leu Ser Thr Leu Glu Ala Trp Leu Pro Ala Lys Glu Gin 
420 425 430 

Leu His Ser Ser Pro Lys Val Val Ser Ser Ala Gin Gin Gin Asn Gly 
435 440 445 

lie His Trp Tyr Ala Asn Ala Leu Tyr Lys Val Lys Asp Tyr Val Leu 
450 455 460 

Pro Gin Ser Trp Arg His Asp 
465 470 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 221 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Cys Gly Pro Gly Arg Gly Xaa Gly Xaa Arg Arg His Pro Lys Lys Leu 
15 10 15 

Thr Pro Leu Ala Tyr Lys Gin Phe lie Pro Asn Val Ala Glu Lys Thr 
20 25 30 

Leu Gly Ala Ser Gly Arg Tyr Glu Gly Lys lie Xaa Arg Asn Ser Glu 
35 40 45 

Arg Phe Lys Glu Leu Thr Pro Asn Tyr Asn Pro Asp lie lie Phe Lys 
50 55 60 

Asp Glu Glu Asn Thr Gly Ala Asp Arg Leu Met Thr Gin Arg Cys Lys 
65 70 75 80 

Asp Lys Leu Asn Xaa Leu Ala lie Ser Val Met Asn Xaa Trp Pro Gly 
85 90 95 

Val Xaa Leu Arg Val Thr Glu Gly Trp Asp Glu Asp Gly His His Xaa 
100 105 110 

Glu Glu Ser Leu His Tyr Glu Gly Arg Ala Val Asp lie Thr Thr Ser 
115 120 125 

Asp Arg Asp Xaa Ser Lys Tyr Gly Xaa Leu Xaa Arg Leu Ala Val Glu 
130 135 140 

Ala Gly Phe Asp Trp Val Tyr Tyr Glu Ser Lys Ala His lie His Cys 
145 150 155 160 

Ser Val Lys Ala Glu Asn Ser Val Ala Ala Lys Ser Gly Gly Cys Phe 
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165 170 175 

Pro Gly Ser Ala Xaa Val Xaa Leu Xaa Xaa Gly Gly Xaa Lys Xaa Val 
180 185 190 

Lys Asp Leu Xaa Pro Gly Asp Xaa Val Leu Ala Ala Asp Xaa Xaa Gly 
195 200 205 

Xaa Leu Xaa Xaa Ser Asp Phe Xaa Xaa Phe Xaa Asp Arg 
210 215 220 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Cys Gly Pro Gly Arg Gly Xaa Xaa Xaa Arg Arg Xaa Xaa Xaa Pro Lys 
15 10 15 

Xaa Leu Xaa Pro Leu Xaa Tyr Lys Gin Phe Xaa Pro Xaa Xaa Xaa Glu 
20 25 30 

Xaa Thr Leu Gly Ala Ser Gly Xaa Xaa Glu Gly Xaa Xaa Xaa Arg Xaa 
35 40 45 

Ser Glu Arg Phe Xaa Xaa Leu Thr Pro Asn Tyr Asn Pro Asp lie lie 
50 55 60 

Phe Lys Asp Glu Glu Asn Xaa Gly Ala Asp Arg Leu Met Thr Xaa Arg 
65 70 75 80 

Cys Lys Xaa Xaa Xaa Asn Xaa Leu Ala lie Ser Val Met Asn Xaa Trp 
85 90 95 

Pro Gly Val Xaa Leu Arg Val Thr Glu Gly Xaa Asp Glu Asp Gly His 
100 105 110 

His Xaa Xaa Xaa Ser Leu His Tyr Glu Gly Arg Ala Xaa Asp lie Thr 
115 120 125 

Thr Ser Asp Arg Asp Xaa Xaa Lys Tyr Gly Xaa Leu Xaa Arg Leu Ala 
130 135 140 

Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr Glu Ser Xaa Xaa His Xaa 
145 150 155 160 

His Xaa Ser Val Lys Xaa Xaa 
165 
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